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BACKGROUND OF THE INVENTION 

This application claims priority to U.S. Provisional Patent Application No. 
60/437,928 filed on January 3, 2003, U.S. Provisional Patent Application No. 60/446,942 
5 filed on February 12, 2003, and U.S. Provisional Patent Application No. 60/474,826 filed 
on May 30, 2003, all of which are incorporated by reference in their entireties. 

The government may own rights in the present invention pursuant to grant 
number GM6 1393 from the National Institutes of Health. 

10 1 . Field of the Invention 

The present invention relates generally to the fields of molecular genetics, 
pharmacogenetics, and cancer therapy. In particular, the present invention is directed to 
methods and compositions for detecting polymorphisms and correlating the presence or 
absence of certain polymporphisms with toxic effects of chemotherapies. More 

15 specifically, the present invention is directed to methods and compositions for 
determining the presence or absence of polymorphisms within a uridine diphosphate 
glucuronosyltransferase I Al (UGT1A1) promoter and correlating these polymorphisms 
with toxic effects of irinotecan, as well as evaluating the risk of an individual for 
developing irinotecan toxicity. In some embodiments, the invention concerns methods 

20 and compositions for predicting or anticipating the level of toxicity caused by irinotecan 
and other compounds glucuronidated by a UGT enzyme in a patient. Such methods and 
compositions can be used to evaluate whether irinotecan-based therapy or therapy 
involving a UGT substrate may pose toxicity problems if given to a particular patient. 
Alterations in suggested therapy may ensue if a toxicity risk is assessed. 

25 

2. Description of Related Art 

Glucuronidation plays a major role in the pharmacological activity and clearance 
of a large variety of compounds (Tukey and Strassburg, 2000). Genetic studies of UDP- 
glucuronosyltransferases (UGTs) aim to characterize an individual's predisposition to 
30 various diseases and increased risk of adverse outcome to drug treatment. The variation 
in the UDP-glucuronosyltransferase 1 Al (UGT1A1) gene is the most extensively studied. 
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UGT1A1 basal expression is affected by the variable number of TA repeats in the TATA 
box, i.e., (TA) n , see U.S. Patent 6,395,481, which is incorporated herein by reference. A 
variable number of repeats (5, 6, 7, and 8) have been found in the UGT1A1 TATA box. 
Gene transcriptional efficiency has been inversely correlated to the number of TA repeats 
5 (Beutler et al, 1998). Thus, a larger TA repeat number is associated with reduced 
transcriptional activity (Beutler et al, 1998) leading to various degrees of impaired 
glucuronidation of UGT1 Al substrates. 

Homozygosity for (TA)7 allele is associated with Gilbert's syndrome (a familial 
mild hyperbilirubinemia) (Bosma et al, 1995 and Monaghan et al, 1996) and 

10 predisposition to the toxic effects of cancer treatment with irinotecan (Ando et al, 2000 
and Iyer et al, 2002). Gilbert's syndrome has also been associated with missense coding 
variants in the UGT1A1 gene, in particular in Asian populations where these variants are 
relatively common. Increased risk of breast cancer was reported in African- American 
women who carried the (TA) 7 and (TA)g alleles (Guillemette et al, 2000). In addition to 

15 the TATA box, Sugatani et al, (2001) identified a region in the UGT1A1 promoter 
approximately 3 kb upstream of the TATA box that regulates UGT1A1 inducibility by 
phenobarbital. It is also hypothesized that this phenobarbital-responsive enhancer 
module (PBREM) might be modulated by endogenous factors (Sugatani et al, 2002). 
UGT1A1 activity is probably the result of PBREM-dependent modulation of TATA box- 

20 dependent basal expression. 

Polymorphisms in UGT1A1 are relevant to the treatment of cancer patients with 
irinotecan. Irinotecan is a topoisomerase I inhibitor that is approved worldwide for the 
treatment of metastatic colorectal cancer. Irinotecan has a well established role as single 
agent in 5-fluorouracil-refractory patients (Rougier et al, 1998; Cunningham et al., 

25 1998), as well as in combination with 5-fluorouracil/leucovorin as a first-line therapy 
(Saltz et al, 2000; Rothenberg et al, 2001). 

Irinotecan hydrolysis by carboxylesterase-2 is responsible for its activation to SN- 
38 (7-ethyl-10-hydroxycamptothecin), a topoisomerase I inhibitor of much higher 
potency than irinotecan. The main inactivating pathway of irinotecan is the 

30 biotransformation of active SN-38 into inactive SN-38 glucuronide (SN-38G). 
Interpatient differences in systemic formation of SN-38G have been shown to have clear 
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clinical consequences in patients treated with irinotecan. Patients with higher 
glucuronidation of SN-38 are more likely to be protected from the dose limiting toxicity 
of diarrhea in the weekly schedule (Gupta et al., 1994). SN-38 is glucuronidated by 
UDP-glucuronosyltransferase 1A1 (UGT1A1) (Iyer et al., 1997). 
5 Despite its efficacy in treating metastatic colon cancer and its broad spectrum of 

activity in other tumor types, irinotecan treatment is associated with significant toxicity. 
SN-38 is an active metabolite of irinotecan, and SN-38 glucuronidation represents a 
mechanism to protect patients from the toxic effects of irinotecan (Gupta et al 9 1994). 
Reduced SN-38 glucuronidation is thought to underlie the severe toxicity associated with 

10 irinotecan treatment in some patients (Gupta et ai, 1994). The main severe toxicities of 
irinotecan are delayed diarrhea and myelosuppression. In the early single agent trials, 
grade 3-4 diarrhea occurred in about one third of patients and was dose limiting (Negoro 
et aL, 1991; Rothenberg et a/., 1993). Its frequency varies from study to study and is also 
schedule dependent. The frequency of grade 3-4 diarrhea in the three-weekly regimen 

15 (19%) is significantly lower compared to the weekly schedule (36%, Fuchs et al, 2003). 
In addition to diarrhea, grade 3-4 neutropenia is also a common adverse event, with about 
30-40% of the patients experiencing it in both weekly and three-weekly regimens (Fuchs 
et al. 9 2003; Vanhoefer et al, 2001). Fatal events during irinotecan treatment have been 
reported. A high mortality rate of 5.3% and 1.6% was reported in the weekly and three- 

20 weekly single agent irinotecan regimens, respectively (Fuchs et al, 2003). 

Although retrospective analysis of UGT1A1 genetic variation in relation to severe 
toxicity after different irinotecan-based regimens has been conducted in Japanese patients 
(Ando et ai, 2000), prospective evaluation in a large trial has not been performed. 

Thus, the problem of identifying the effects of various promoter polymorphism 

25 combinations on the expression of UGT1A1 for the determination of UGT activity levels 
remains. Improved methods and compositions for the evaluation of risk for irinotecan 
toxicity in an individual or patient are still needed. 

SUMMARY OF THE INVENTION 

30 Metabolism of SN-38, an active metabolite of irinotecan, via glucuronidation 

represents a mechanism to protect patients from the toxic effects of irinotecan, thus a 
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reduction in SN-38 glucuronidation contributes to the probability that toxicity associated 
with irinotecan may be experienced in patients. While some genetic basis for reduced 
SN-38 glucuronidation have been identified, other basis have yet to be identified. 
Therefore, there remains a need for improved methods and compositions for evaluating 
5 polymorphisms in one or both UGT1A1 genes of a patient and correlating a genotype 
with adverse effects of various therapies. 

The present invention is based on the fact that genetic variation is correlated with 
UGT1A1 expression and has several important clinical implications. The improved 
methods and compositions of the present invention may be used in determining if a 

10 treatment has a propensity to adversely affect a patient or what treatment may be 
appropriate or inappropriate for a particular patient. UGT1AJ basal transcription is 
affected by a polymorphic (TA) repeat (see Fig. 1 legend in Innocenti et aL 9 2002), in 
addition to a phenobarbital-responsive enhancer module (PBREM) that contains variants 
affecting inducible gene expression, as described herein. A "polymorphism'* or "genetic 

15 polymorphism," as referred to herein, is the existence of two or more variant forms of a 
particular characteristic, e.g., a single nucleotide or a repeat a nucleotide or nucleotides. 
Generally, variations are due to the addition, deletion, or substitution of one or more 
nucleotides at a site or a variation in the number of tandem repeats of a DNA sequence. 
In various embodiments, other polymorphisms within or outside the UGT1 gene locus, 

20 see Genbank accession number AF297093 which is incorporated herein by reference, 
may be used as long as an association of a polymorphism with a particular phenotype 
and/or haplotype can be established. Exemplary methods for genotyping a UGT1A gene 
may be found at least in U.S. Patents 6,479,236, 6,472,157 and 6,395,481, each of which 
is incorporated herein by reference. 

25 In various embodiments of the invention, significant linkage disequilibrium 

between a (TA) polymorphism and variants in the PBREM, or other variants within or 
outside the UGT1 gene locus, indicates that patients possessing such other variants may 
be at risk of irinotecan toxicity. "Significant" as used in respect to linkage 
disequilibrium, as determined by one of skill in the art, is contemplated to be a statistical 

30 p or a value that may be 0.25 or 0.1 and may be 0.1, 0.05. 0.001, 0.00001 or less. 
"Linkage disequilibrium" ("LD" as used herein, though also referred to as "LED" in the 
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art) refers to a situation where a particular combination of alleles (i.e., a variant form of a 
given gene) or polymorphisms at two loci appears more frequently than would be 
expected by chance. The relationship between PBREM-(TA) n haplotypes and the 
glucuronidation rate of the UGT1A1 substrate SN-38 may be used to correlate the 
5 genotype (i.e., the genetic make up of an organism) to a phentoype (i.e., the physical 
traits displayed by an organism or cell). "Haplotype" is used herein to refer to a 
collective genotype of two or more closely linked loci. Each haplotype defines the 
sequence of alleles or polymorphisms along one of the homologous chromosomes. In 
some embodiments, the polymorphisms may be 0.001, 0.01, 0.1, 0.2 cM or more within 
10 one another. 

Various embodiments of the invention include methods for evaluating the risk of 
toxicity from irinotecan, or other UGT1 Al substrates, in a patient. A polymorphism may 
be a single nucleotide polymorphism (SNP) and may be in linkage disequilibrium with a 
(TA) n repeat. In certain embodiments, the methods include detecting one or more 

15 polymorphisms in one or both copies of the UGT1A1 gene and/or one or both copies of 
any other gene located in the UGT1 gene locus of a patient. In particular embodiments a 
promoter polymorphism is detected. It is specifically contemplated that methods and 
compositions of the invention may be implemented to determine whether UGT1A1 
polymorphisms are present or absent in one or both alleles. 

20 In certain embodiments, a polymorphism may be a polymorphism that affects the 

transcription of UGT1A1, such as in the promoter region or 5' flanking region that affects 
transcription (which includes the promoter region), and in particular a polymorphism at 
nucleotide position -3440, -3401, -3279, -3177, -3175, or -3156 from the UGT1A1 gene 
transcriptional start site, which is designated +1 with no nucleotide designated as 0. The 

25 number of TA repeats can be 5, 6, 7, 8 or more TA repeats. In particular embodiments, 
the polymorphism is the following: -3440OA, -3401 T>C, -3279G>T, -31770G, - 
3175A>G, -3156G>A, or any combination thereof. The notation -3440OA, for example 
indicates that cytosine nucleotide (C) at the -3440 position is replaced by an Adenosine 
(A). 

30 Methods of the invention may include obtaining a nucleic acid sample from a 

patient and detecting one or more polymorphisms in the UGT1A1 gene using various 
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methods. In certain embodiments, polymorphism detection may include amplifying a 
nucleic acid containing all or part of a particular region of the UGT1A1 gene to obtain 
amplification products; and/or analyzing the amplification products for the presence or 
absence of one or more polymorphisms. Other methods of polymorphism detection 
5 known in the art are also contemplated. 

In certain embodiments, a promoter polymorphism of a UGT1A1 gene may be 
detected by performing one of a variety of known assays. These may include, but are not 
limited to hybridization assays, sequencing or microsequencing assays, allele-specific 
amplification assays or any other methods known for detecting nucleic acid 

10 polymorphisms, which may or may not include amplification of a nucleic acid. It is 
understood that "detecting" a polymorphism includes identifying the nucleotide sequence 
at that site and/or determining whether the polymorphism is present or absent. 

A correlation between one or more polymorphisms and the glucoronidation rate 
of irinotecan or other substrates of UGT1A1, including but not limited to bilirubin, 

15 estriol, beta-estradiol, 2-hydroxyestriol, 2-hydroxyestrone, 2-hydroxyestradiol, thyroxine 
(T4), rT3, octyl gallate, propyl gallate, anthraflavic acid, quercitin, fisetin, naringenin, 1- 
naphtol, and ethynylestradiol, may be used to determine various aspects of a treatment 
regime, including irinotecan and/or other drugs or compounds metabolized directly or 
indirectly by UGT1A1. In some embodiments the methods also include analyzing the 

20 glucuronidation rate associated with the various polymorphisms and polymorphism 
combinations, for exemplary methods and compositions related to analysis of 
glucuronidation rates see U.S. Patent 6,319,678, which is incorporated herein by 
reference. The methods may also include determining the biliary transport capacity of 
the patient. In particular embodiments the evaluation of the promoter polymorphism may 

25 be used to optimize the dose of irinotecan or other compounds for treatment of a patient 
or to reduce their toxicity. 

The methods of the invention may further include treating a patient by 
administering to the patient irinotecan in combination with other pharmaceutical agents at 
appropriate dosages, such that the toxicity of irinotecan or other substrates of UGT1A1 

30 are reduced. In particular embodiments, a second agent that reduces excretion of an 
active irinotecan species through the bile may administered in conjunction with 
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irinotecan based upon determinations made using methods and compositions of the 
invention, for related methods and compositions see U.S. Patents 6,407,117, 6,287,834 
and 5,786,344, each of which is incorporated herein by reference. 

The present invention is also based on the observation that the nucleotide at 
5 position -3516 in the UGT1A1 upstream region is correlated with irinotecan toxicity. An 
A at that position positively correlates with irinotecan toxicity while a G at that position 
correlates with tolerance to irinotecan. Thus, the present invention concerns methods and 
compositions for evaluating, predicting, and determining whether a patient will 
experience toxicity from irinotecan. Toxicity from irinotecan evidences itself as side 

10 effects from the drug, which are well known to oncologists and their patients. 

In some embodiments of the invention, there are methods of predicting whether a 
. patient may suffer or be subject to toxicity from irinotecan if given it involving 
determining the nucleic acid sequence of base -3516 in the UGT1A1 5' flanking region in 
one or both alleles of the patient. The presence of an A nucleotide indicates the person is 

15 at risk for irinotecan toxicity. An AA genotype is more closely correlated with grade 4 
neutropenia than other genotypes at that position. Moreover, in some embodiments, this 
is unrelated to the genotype of the TA indel in the UGT1A1 promoter. It is contemplated 
that these methods concerning the indel at position -3516 in the UGT1A1 5' flanking 
region can be implemented with methods involving determining one or more other 

20 polymorphisms in the UGT1A1 5' flanking region of the same patient. 

Consequently, if a person is identified as at risk for irinotecan toxicity based on 
any of the embodiments discussed herein, an alternative course of therapy or a lower dose 
of irinotecan than is normally given may be contemplated. In addition, methods also 
include determining the sequence of other polymorphisms or indels (insertion/deletions) 

25 in linkage disequilibrium (LD) with the -3516 variant. Therefore, in some embodiments 
of the invention, the TA indel is evaluated to determine the number of repeats. Also, any 
other variant in UGT1 Al or any other gene (the term "gene" includes non-coding regions 
that affect the expression or activity level of the encoded polypeptide) may be evaluated 
for variants in LD with the -3516 variant. 

30 Various embodiments may include a kit for evaluating the risk of irinotecan 

toxicity in a patient. The kit may include a variety of containers, reagents and the like. 
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In certain embodiments, the kit may include an oligonucleotide primer to amplify a 
promoter region of a UGT1A1 gene or genes, haplotype tag SNPs or allele specific 
amplification primers of the UGT1A1 gene or any other primer within the UGT1 gene 
locus. The haplotype tag SNPs or allele specific primers may be used to amplify a 
5 polymorphism at one or more nucleotide positions of the UGT1A1 gene or other UGT1 
locus gene. In particular embodiments, the nucleotide position may be at -3440, -3401, - 
3279, -3177, -3175, or -3156, or a combination thereof, from the UGT1A1 gene 
transcriptional start site. The kit may include the haplotype fag SNPs or allele specific 
amplification primers in multi-well assay plate. The kit may also include haplotype tag 

10 SNPs or allele specific hybridization probes for a variety of promoter polymorphisms. 
The haplotype tag SNPs or allele specific hybridization probes may detect 
polymorphisms at nucleotide position -3440, -3401, -3279, -3177, -3175, or -3156 from 
the UGT1A1 gene transcriptional start site. The kit may include haplotype tag SNPs or 
allele specific hybridization probes comprised in an oligonucleotide array or microarray. 

15 Compositions of the invention include nucleic acids that can be used to determine 

the sequence at position -3516 of UGT1A or other reagents in that regard. Arrays and 
other assays for screening multiple samples are also included as part of the invention. 
Such compositions may be incorporated into kits or as part of a kit, along with any other 
composition discussed herein. 

20 It is contemplated that any method or composition described herein can be 

implemented with respect to any other method or composition described herein. 
Similarly, any embodiment discussed with respect to one aspect of the invention may be 
used in the context of any other aspect of the invention. 

Throughout this application, the term "about" is used to indicate that a value 

25 includes the standard deviation of error for the device or method being employed to 
determine the value. 

The use of the word "a" or "an" when used in conjunction with the term 
"comprising" in the claims and/or the specification may mean "one," but it is also 
consistent with the meaning of "one or more," "at least one," and "one or more than one." 
30 Other objects, features and advantages of the present invention will become 

apparent from the following detailed description. It should be understood, however, that 
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the detailed description and the specific examples, while indicating specific embodiments 
of the invention, are given by way of illustration only, since various changes and 
modifications within the spirit and scope of the invention will become apparent to those 
skilled in the art from this detailed description. 

5 

BRIEF DESCRIPTION OF THE DRAWINGS 

The following drawings form part of the present specification and are included to 
further demonstrate certain aspects of the present invention. The invention may be better 
10 understood by reference to one or more of these drawings in combination with the 
detailed description of specific embodiments presented herein. 

FIG. 1 illustrates an exemplary phenobarbital-responsive enhancer module 
(PBREM) and description of polymorphisms. The previously described PBREM 

15 domains are underlined with the NR half-site sequences shown in bold. The polymorphic 
sites of the present application are included. The variants found at these sites are also 
listed. Positions indicated are from the first base of the UGT1A cluster sequence 
(Genbank accession No. AF297093). 

FIG. 2A-2D illustrates (TA) n genotype-phenotype relationship in human livers. 

20 (a) correlation in all samples investigated (n=83) (b) correlation in Caucasians (n=56) (c) 
correlation in African- Americans (n=15) (d) correlation in individuals of Asian (n=l), 
and unknown (n=10) ethnicity. Liver microsomes were phenotyped for SN^38 
glucuronidation rates in each liver with a single experiment performed in triplicate. Bars 
show the mean value of SN-38 glucuronidation rates in each group. 

25 FIG. 3 illustrates an exemplary haplotype-phenotype relationship in human livers 

of Caucasian and African origin (n=70). Bars show the mean value of SN-38 
glucuronidation rates in each group. Only haplotypes with >2 samples are shown. 

FIG. 4. Correlation between ANC and TA indel genotype. Bars represent the 
means. Nonparametric trend analysis (7/7<6/7<6/6, z = -2.72, p=0.01). 

30 FIG. 5. Pre-treatment total bilirubin levels and distribution of the -3156 

genotypes within each TA indel genotype. The -3156 AA genotypes are reported in 
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squares, the GA genotypes in circles and the GG genotypes in triangles. Bars represent 
the mean values. A significant trend was reported (7/7>6/7>6/6, z=2.88, pO.Ol, 
rionparametric trend analysis). 

FIG, 6. Correlation between ln(ANC nadir) and pretreatment total bilirubin 
5 levels. Patients with bilirubin levels less than 0.6 mg/dl are depicted in squares. Those 
with bilirubin levels higher than 0.7 mg/dl are depicted in circles. 

DESCRIPTION OF ILLUSTRATIVE EMBODIMENTS 

The present invention provides improved methods and compositions for 

10 identifying the effects of various polymorphisms, promoter polymorphisms or any 
combination thereof on the expression of UGT1A1 or the glucuronidation rate of 
UGT1 Al for the evaluation of the potential or risk for irinotecan toxicity in an individual 
or patient. The development of these improved methods and compositions allows for the 
use of such an evaluation to optimize treatment of a patient and to lower the risk of 

15 toxicity. In certain aspects of the invention various combinations of promoter 
polymorphisms may be used in this evaluation, in particular, polymorphisms in the 
PBREM region and polymorphisms in the TA repeats may be used. 

Genetic variation in UGT1A1 expression has several important clinical 
implications. UGT1A1 basal transcription is affected by a polymorphic (TA) repeat. 

20 Another important regulatory element is the phenobarbital-responsive enhancer module 
(PBREM) which may contain variants affecting inducible gene expression. The 
examples provided herein study the extent of linkage disequilibrium between the (TA) 
polymorphism and variants in the PBREM and UGT1A1 promoter. The relationship 
between PBREM-(TA)n haplotypes and the glucuronidation rate of the UGT1A1 substrate 

25 SN-38 is also addressed herein. Studies described in the Examples illustrate that SN-38G 
formation rates were correlated with (TA) genotypes and promoter variants. In vaious 
aspects particular (TA) variants are in linkage disequilibrium with various other 
polymorphisms. 

Certain aspects of the invention are based on, but not limited to, the observation 
30 and characterization of novel polymorphisms in the PBREM region of the UGT1A1 gene. 
Due to the clinical implications of genetically modified regulation of UGT1A1 activity, 

25369703.1 

•11- 



the PBREM region was sequenced and polymorphisms in the TATA box of the UGT1A1 
promoter genotyped, as described in the Examples section below. 

I. HEPATIC GLUCURONIDATION BY UGT ENZYMES 

5 Hepatic glucuronidation results from the activities of a multigene family of UGT 

enzymes, the members of which exhibit specificity for a variety of endogenous substrates 
and xenobiotics. The UGT enzymes are broadly classified into two distinct gene 
families. The UGT1 locus codes for multiple isoforms of UGT, all of which share a C- 
terminus encoded by a unique set of exons 2-5, but which have a variable N-terminus 

10 encoded by different first exons, each with its own independent promoter (Bosma et al 9 
1992; Ritter et al, 1992). The variable first exons confer the substrate specificity of the 
enzyme. Isoforms of the UGT2 family are unique gene products of which at least eight 
isozymes have been identified (Clarke et ah Handbook of Experimental Pharmacology 
1994). The UGT1A1 isoform is the major bilirubin glucuronidation enzyme. Genetic 

15 defects in the UGT1A1 gene can result in decreased glucuronidation activity which leads 
to abnormally high levels of unconjugated serum bilirubin that may enter the brain and 
cause encephalopathy and kernicterus (Owens & Ritter, 1995). This condition is 
commonly known as Gilbert's syndrome. The molecular defect in Gilbert's Syndrome is 
a change in the TATA box within the UGT1A1 promoter (Bosma et ai, 1995 and 

20 Monaghan et ai 9 1996). This promoter usually contains a (TA) 6 TAA element, but 
another allele, termed UGT1A1*28 or allele 7, is also present in human populations at 
high frequencies, and contains the sequence (TA)7 TAA. This polymorphism in the 
promoter of the UGT1A1 gene results in reduced expression of the gene and accounts for 
most cases of Gilbert's Syndrome (Bosma et aL, 1995). Overall, gene expression levels 

25 for the UGT1 Al promoter alleles are inversely related to the length of the TA repeat in 
the TATA box. 

The variation observed in this promoter may also account for the inter-individual 
and inter-ethnic variation in drug metabolism and response to xenobiotic exposure. 
UGTs have been shown to contribute to the detoxification and elimination of both 
30 exogenous and endogenous compounds. For example, one typical role of the UGT1A1 
isoform is the glucuronidation of SN-38 (7-ethyl-10-hydroxycamptothecin) to the 
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corresponding glucuronide (10-O-glucuronyl-SN-38, SN-38G) as well as the 
glucuronidation of TAS- 1 03 (6-[[2-(dimethylamino)ethyl]amino]-3-hydroxy-7H- 
indeno[2,l-c]quinoline-7-one dihydrochloride) to its corresponding glucuronide (TAS- 
103G). SN-38 is the active form of irinotecan (CPT-11, 7-ethyl-10-[4-(l-piperidino)-l- 
5 piperidinojcarbonyloxycamptothecin), which is a camptothecin derivative used in the 
treatment of metastatic colorectal cancer and other malignancies. The metabolism of SN- 
38 and TAS- 103 (also known as flavopiridol) are merely illustrative of the present 
invention the metabolism of other UGT1A1 substrates is also contemplated, such as 
estradiol, bilirubin, simple phenols, flavones, CI 8 steroids, complex phenols and 
10 coumarins. 

Irinotecan is biotransformed by tissue and serum carboxylesterases to an active 
metabolite, SN-38, which has a 100-1,000-fold higher antitumor activity than irinotecan. 
SN-38 is glucuronidated by hepatic uridine diphosphate glucuronosyltransferases (UGTs) 
to form SN-38 glucuronide (10O-glucuronyl-SN-38, SN-38G), which is inactive and 

15 excreted into the bile and urine although, SN-38G might be deconjugated to form SN-38 
by intestinal P-glucuronidase enzyme (Kaneda et al, 1990). 

The major dose-limiting toxicities of irinotecan include diarrhea and, to a lesser 
extent, myelosuppression. irinotecan-induced diarrhea can be serious and often does not 
respond adequately to conventional antidiarrheal agents (Takasuna et a/., 1995). This 

20 diarrhea may be due to direct enteric injury caused by the active metabolite, SN-38, 
which has been shown to accumulate in the intestine after intra peritoneal administration 
of irinotecan in athymic mice (Araki et al. 9 1993). The results of a recently completed 
phase I clinical trial, demonstrated that there was an inverse relationship between SN-38 
glucuronidation rates and severity of diarrheal incidences in patients treated with 

25 increasing doses of Irinotecan (Gupta et al 9 1994). These findings indicate that 
glucuronidation of SN-38 protects against Irinotecan-induced gastrointestinal toxicity. A 
complete discussion of the correlation between diarrhea and SN-38 glucuronidation, as 
well as a description of biochemical methods for determining glucurondation levels can 
be found in US Patent 5,786,344 and W096/01 127 which are both incorporated herein by 

30 reference in their entirety. Likewise, the results of studies using TAS-103 demonstrate 
that glucuronidation of TAS-103 may protect against TAS-103 induced toxicity. 

25369703.1 

-13- 



Therefore, the conversion of these two toxic compounds by hepatic UGTs demonstrates 
the importance of monitoring glucuronidation activity as an indicator of susceptibility to 
toxicity caused by exposure to compounds that are metabolized by UGTs. Furthermore, 
differential rates of SN-38 glucuronidation among subjects may explain the considerable 
5 inter-individual variation in the pharmacokinetic parameter estimates and toxicities 
observed after treatment with anti-cancer drugs or exposure to xenobiotics (Gupta et al. 9 
1994; Gupta etf ai, 1997). 

When two species, Gunn rats (Gunn, 1938) and CN-1 patients, that are deficient 
in UGT1A isoforms were screened for TAS-103 and SN-38 glucuronidation activity, 

10 there was approximately an 80% lower glucuronidation rate of TAS-103 in vitro and no 
in vitro glucuronidation of SN-38 compared to healthy liver donors. These results 
demonstrate the role of the UGT 1 family in catalyzing SN-38 and TAS-103 conjugation. 
Furthermore, these results demonstrate that the UGT2 family does not play a role in the 
glucuronidation of SN-38. On the other hand, while isoforms of the UGT1 family are the 

15 predominate isoforms involved in TAS-103 glucuronidation, the isoform of the UGT2 
family may also participate in TAS-103 glucuronidation. Failure to glucuronidate SN-38 
and TAS-103 in these instances may result specifically from the genetic defect in UGT1 
gene family. 

Other experiments confirm the association between the UGT1A1 isoform and SN- 
20 38 and TAS-103 glucuronidation. These studies show that substantial genetic variability 
exists in the UGT1A1 isoform family and particularly in the UGT1A1 promoter. This 
genetic variability has been shown to correlate with gene expression. For example, the 
presence of the 5 allele in the UGT1A1 promoter leads to increased gene expression while 
the presence of the 8 allele leads to reduced gene expression. Differences in gene 
25 expression levels may give rise to individuals with varying abilities to glucuronidate 
compounds metabolized by UGTs. This prediction was confirmed through a correlation 
analysis of UGT1A1 promoter genotype and rate of in vitro SN-38 and TAS-103 
glucuronidation. 

It follows therefore that individuals with the 8 allele may also have differing 
30 susceptibility to xenobiotics when compared to other genotypes when those compounds 
are metabolized by UGTlAls. On the other hand, the presence of the 5 allele that 
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correlates with increased gene expression and higher glucuronidation activity may result 
in the administration of less than optimum drug dosages. For example, when a drug 
metabolized by UGTlAls is administered to an individual with this polymorphism, the 
increased glucuronidation activity may cause more of the drug to be converted into the 
5 inactive metabolite in a shorter period, thereby reducing the drug's effectiveness. 
Conversely, in the rare case of drugs and xenobiotics that require glucuronidation for 
activation, decreased glucuronidation activity may cause less of the activated form of the 
drug or xenobiotic to be available. 

The fact that repeated sequences are intrinsically unstable and tend to lengthen 

10 and shorten as a result of unequal crossing-over during meiosis may explain the presence 
of other alleles, in addition to (TA) 6 and (TA) 7 , in the population. Two additional alleles 
have been identified in human populations: allele 5, containing the sequence (TA)s TAA 
and allele 8, containing the sequence (TA) 8 TAA, see U.S. Patent 6,395,481, which is 
incorporated in its entirety by reference. Interestingly, alleles 5 and 8 were found 

15 predominantly in population samples from Sub-Saharan Africa, where they occur at 
lower frequencies than the common alleles 6 and 7 although it is possible that these two 
alleles are present across a variety of ethnic groups. The frequency of alleles 6 and 7 also 
appears to differ significantly across ethnic groups, with Asian and Amerindian 
populations showing the highest frequencies of allele 6. Conversely, alleles 6 and 7 occur 

20 at intermediate and similar frequencies among Caucasians and Sub-Saharan Africans. 

Several hypotheses may be proposed regarding the selective pressures that might 
be responsible for the observed pattern of inter-population variation at the UGT1A1 
promoter. It was previously proposed that the maintenance of intermediate levels of 
bilirubin is adaptive (Beutler et aL 9 1998), and that the alleles at this promoter would be 

25 maintained in the population by balancing selection. This hypothesis is based on the 
observation that bilirubin is a potent antioxidant likely to have physiological significance 
(Stocker et al 9 1987). However, it is also known that glucuronidation is an important 
detoxification step for many endogenous as well as exogenous compounds (Clarke & 
Burchell, 1994). In addition to TAS-103 and SN-38, UGT1A1 is likely to act on other 

30 substrates present in the environment, e.g., dietary components, environmental pollutants 
and carcinogens, which require detoxification as well as playing a role in the metabolism 
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of bilirubin and other endogenous compounds. Within this framework, maintaining high 
levels of UGT1A1 gene expression would ensure rapid elimination of toxic or 
endogenous compounds and be advantageous. 

As described herein, the correlation between in vitro glucuronidation rate and 
5 UGT1 Al promoter polymorphism found for alleles 6 and 7 has been shown to extend to 
alleles 5 and 8. Because these alleles appear to be more frequent in subsets of human 
populations (for example, those of African origin), an even higher inter-individual 
variability in SN-38 and TAS-103 metabolism might be expected within these 
populations. Because the inverse relationship between TA repeat size and rate of SN-38 
10 glucuronidation extends to alleles 5 and 8, a screening assay that identifies these alleles 
can facilitate individualization of drug therapy, identify individuals susceptible to 
xenobiotic exposure, and can be used to improve drug dosage calculations. 

II. NUCLEIC ACIDS 

15 Certain embodiments of the present invention concern various nucleic acids, 

including promoters, amplification primers, oligonucleotide probes and other nucleic acid 
elements involved in the analysis of genomic DNA. In certain aspects, a nucleic acid 
comprises a wild-type, a mutant, or a polymorphic nucleic acid. 

The term "nucleic acid" is well known in the art. A "nucleic acid" as used herein 

20 will generally refer to a molecule (i.e. r a strand) of DNA, RNA or a derivative or analog 
thereof, comprising a nucleobase. A nucleobase includes, for example, a naturally 
occurring purine or pyrimidine base found in DNA (e.g., an adenine "A," a guanine "G," 
a thymine "T" or a cytosine "C") or RNA (e.g., an A, a G, an uracil "U" or a C). The 
term "nucleic acid" encompass the terms "oligonucleotide" and "polynucleotide," each as 

25 a subgenus of the term "nucleic acid." The term "oligonucleotide" refers to a molecule of 
between about 3 and about 100 nucleobases in length. The term "polynucleotide" refers 
to at least one molecule of greater than about 100 nucleobases in length. A "gene" refers 
to coding sequence of a gene product, as well as introns and the promoter of the gene 
product. In addition to the UGT1A1 gene, other regulatory regions such as enhancers for 

30 UGT1A1 are contemplated as nucleic acids for use with compositions and methods of the 
claimed invention. 
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These definitions generally refer to a single-stranded molecule, but in specific 
embodiments will also encompass an additional strand that is partially, substantially or 
fully complementary to the single-stranded molecule. Thus, a nucleic acid may 
encompass a double-stranded molecule or a triple-stranded molecule that comprises one 
5 or more complementary strand(s) or "complement(s)" of a particular sequence 
comprising a molecule. As used herein, a single stranded nucleic acid may be denoted by 
the prefix "ss", a double stranded nucleic acid by the prefix "ds", and a triple stranded 
nucleic acid by the prefix "ts." 

In particular aspects, a nucleic acid encodes a protein, polypeptide, or peptide. In 
10 certain embodiments, the present invention concerns novel compositions comprising at 
least one proteinaceous molecule. As used herein, a "proteinaceous molecule," 
"proteinaceous composition," "proteinaceous compound," "proteinaceous chain," or 
"proteinaceous material" generally refers, but is not limited to, a protein of greater than 
about 200 amino acids or the full length endogenous sequence translated from a gene; a 
15 polypeptide of greater than about 100 amino acids; and/or a peptide of from about 3 to 
about 100 amino acids. All the "proteinaceous" terms described above may be used 
interchangeably herein! 

1 . Preparation of Nucleic Acids 

A nucleic acid may be made by any technique known to one of ordinary skill in 
20 the art, such as for example, chemical synthesis, enzymatic production or biological 
production. Non-limiting examples of a synthetic nucleic acid (e.g., a synthetic 
oligonucleotide), include a nucleic acid made by in vitro chemical synthesis using 
phosphotriester, phosphite or phosphoramidite chemistry and solid phase techniques such 
as described in European Patent 266,032, incorporated herein by reference, or via 
25 deoxynucleoside H-phosphonate intermediates as described by Froehleref al, 1986 and 
U.S. Patent 5,705,629, each incorporated herein by reference. In the methods of the 
present invention, one or more oligonucleotide may be used. Various different 
mechanisms of oligonucleotide synthesis have been disclosed in for example, U.S. 
Patents 4,659,774, 4,816,571, 5,141,813, 5,264,566, 4,959,463, 5,428,148, 5,554,744, 
30 5,574,146, 5,602,244, each of which is incorporated herein by reference. 
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A non-limiting example of an enzymatically produced nucleic acid include one 
produced by enzymes in amplification reactions such as PCR™ (see for example, U.S. 
Patent 4,683,202 and U.S. Patent 4,682,195, each incorporated herein by reference), or 
the synthesis of an oligonucleotide described in U.S. Patent 5,645,897, incorporated 
5 herein by reference. A non-limiting example of a biologically produced nucleic acid 
includes a recombinant nucleic acid produced (i.e., replicated) in a living cell, such as a 
recombinant DNA vector replicated in bacteria (see for example, Sambrook etal 2001, 
incorporated herein by reference). 

2. Purification of Nucleic Acids 

10 A nucleic acid may be purified on polyacrylamide gels, cesium chloride 

centrifugation gradients, chromatography columns or by any other means known to one 
of ordinary skill in the art (see for example, Sambrook etal, 2001, incorporated herein 
by reference). In some aspects, a nucleic acid is a pharmacologically acceptable nucleic 
acid. Pharmacologically acceptable compositions are known to those of skill in the art, 

1 5 and are described herein. 

In certain aspects, the present invention concerns a nucleic acid that is an isolated 
nucleic acid. As used herein, the term "isolated nucleic acid" refers to a nucleic acid 
molecule (e.g., an RNA or DNA molecule) that has been isolated free of, or is otherwise 
free of, the bulk of the total genomic and transcribed nucleic acids of one or more cells. 

20 In certain embodiments, "isolated nucleic acid" refers to a nucleic acid that has been 
isolated free of, or is otherwise free of, bulk of cellular components or in vitro reaction 
components such as for example, macromolecules such as lipids or proteins, small 
biological molecules, and the like. 

3. Nucleic Acid Segments 

25 In certain embodiments, the nucleic acid is a nucleic acid segment. As used 

herein, the term "nucleic acid segment," are fragments of a nucleic acid, such as, for a 
non-limiting example, those that encode only part of a UGTJ gene locus or a UGT1A1 
gene sequence. Thus, a "nucleic acid segment" may comprise any part of a gene 
sequence, including from about 2 nucleotides to the full length gene including promoter 

30 regions to the polyadenylation signal and any length that includes all the coding region. 
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Various nucleic acid segments may be designed based on a particular nucleic acid 
sequence, and may be of any length. By assigning numeric values to a sequence, for 
example, the first residue is 1, the second residue is 2, etc., an algorithm defining all 
nucleic acid segments can be created: 
5 n to n + y 

where n is an integer from 1 to the last number of the sequence and y is the length of the 
nucleic acid segment minus one, where n + y does not exceed the last number of the 
sequence. Thus, for a 10-mer, the nucleic acid segments correspond to bases 1 to 10, 2 to 
1 1, 3 to 12 ... and so on. For a 15-mer, the nucleic acid segments correspond to bases 1 to 

10 15, 2 to 16, 3 to 17 ... and so on. For a 20-mer, the nucleic segments correspond to bases 
1 to 20, 2 to 21, 3 to 22 ... and so on. In certain embodiments, the nucleic acid segment 
may be a probe or primer. As used herein, a "probe" generally refers to a nucleic acid 
used in a detection method or composition. As used herein, a "primer" generally refers to 
a nucleic acid used in an extension or amplification method or composition. 

15 4. Nucleic Acid Complements 

The present invention also encompasses a nucleic acid that is complementary to a 
nucleic acid. A nucleic acid is "complement(s)" or is "complementary" to another 
nucleic acid when it is capable of base-pairing with another nucleic acid according to the 
standard Watson-Crick, Hoogsteen or reverse Hoogsteen binding complementarity rules. 

20 As used herein "another nucleic acid" may refer to a separate molecule or a spatial 
separated sequence of the same molecule. In preferred embodiments, a complement is a 
hybridization probe or amplification primer for the detection of a nucleic acid 
polymorphism. 

As used herein, the term "complementary" or "complement" also refers to a 
25 nucleic acid comprising a sequence of consecutive nucleobases or semiconsecutive 
nucleobases {e.g., one or more nucleobase moieties are not present in the molecule) 
capable of hybridizing to another nucleic acid strand or duplex even if less than all the 
nucleobases do not base pair with a counterpart nucleobase. However, in some 
diagnostic or detection embodiments, completely complementary nucleic acids are 
30 preferred. 
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III. NUCLEIC ACID DETECTION 

Some embodiments of the invention concern identifying polymorphisms in 
UGT1A1, correlating genotype or haplotype to phenotype, wherein the phenotype is 
lowered or altered UGT1A1 activity or expression, and then identifying such 
5 polymorphisms in patients who have or will be given irinotecan or related drugs or 
compounds. Thus, the present invention involves assays for identifying polymorphisms 
and other nucleic acid detection methods. Nucleic acids, therefore, have utility as probes 
or primers for embodiments involving nucleic acid hybridization. They may be used in 
diagnostic or screening methods of the present invention. Detection of nucleic acids 

10 encoding UGT1A1, as well as nucleic acids involved in the expression or stability of 
UGT1A1 polypeptides or transcripts, are encompassed by the invention. General 
methods of nucleic acid detection methods are provided below, followed by specific 
examples employed for the identification of polymorphisms, including single nucleotide 
polymorphisms (SNPs). 

15 A. Hybridization 

The use of a probe or primer of between 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, or 15 
and 50, 60, 70, 80, 90, or 100 nucleotides, preferably between 17 and 100 nucleotides in 
length, or in some aspects of the invention up to 1-2 kilobases or more in length, allows 
the formation of a duplex molecule that is both stable and selective. Molecules having 

20 complementary sequences over contiguous stretches greater than 20 bases in length are 
generally preferred, to increase stability and/or selectivity of the hybrid molecules 
obtained. One will generally prefer to design nucleic acid molecules for hybridization 
having one or more complementary sequences of 20 to 30 nucleotides, or even longer 
where desired. Such fragments may be readily prepared, for example, by directly 

25 synthesizing the fragment by chemical means or by introducing selected sequences into 
recombinant vectors for recombinant production. 

Accordingly, the nucleotide sequences of the invention may be used for their 
ability to selectively form duplex molecules with complementary stretches of DNAs 
and/or RNAs or to provide primers for amplification of DNA or RNA from samples. 

30 Depending on the application envisioned, one would desire to employ varying conditions 
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of hybridization to achieve varying degrees of selectivity of the probe or primers for the 
target sequence. 

For applications requiring high selectivity, one will typically desire to employ 
relatively high stringency conditions to form the hybrids. For example, relatively low 
5 salt and/or high temperature conditions, such as provided by about 0.02 M to about 0.10 
M NaCl at temperatures of about 50°C to about 70°C. Such high stringency conditions 
tolerate little, if any, mismatch between the probe or primers and the template or target 
strand and would be particularly suitable for isolating specific genes or for detecting a 
specific polymorphism. It is generally appreciated that conditions can be rendered more 
10 stringent by the addition of increasing amounts of formamide. For example, under highly 
stringent conditions, hybridization to filter-bound DNA may be carried out in 0.5 M 
NaHP0 4 , 7% sodium dodecyl sulfate (SDS), 1 mM EDTA at 65°C, and washing in 0.1 x 
SSC/0.1% SDS at 68°C (Ausubel et aL, 1989). 

Conditions may be rendered less stringent by increasing salt concentration and/or 
15 decreasing temperature. For example, a medium stringency condition could be provided 
by about 0.1 to 0.25M NaCl at temperatures of about 37°C to about 55°C, while a low 
stringency condition could be provided by about 0.1 5M to about 0.9M salt, at 
temperatures ranging from about 20°C to about 55°C. Under low stringent conditions, 
such as moderately stringent conditions the washing may be carried out for example in 
20 0.2 x SSC/0.1% SDS at 42°C (Ausubel et al, 1989). Hybridization conditions can be 
readily manipulated depending on the desired results. 

In other embodiments, hybridization may be achieved under conditions of, for 
example, 50mM Tris-HCl (pH 8.3), 75mM KC1, 3mM MgCl 2 , l.OmM dithiothreitol, at 

temperatures between approximately 20°C to about 37°C. Other hybridization conditions 
25 utilized could include approximately lOmM Tris-HCl (pH 8.3), 50mM KC1, 1.5mM 
MgCl2, at temperatures ranging from approximately 40°C to about 72°C. 

In certain embodiments, it will be advantageous to employ nucleic acids of 
defined sequences of the present invention in combination with an appropriate means, 
such as a label, for determining hybridization. A wide variety of appropriate indicator 
30 means are known in the art, including fluorescent, radioactive, enzymatic or other 
ligands, such as avidin/biotin, which are capable of being detected. In preferred 
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embodiments, one may desire to employ a fluorescent label or an enzyme tag such as 
urease, alkaline phosphatase or peroxidase, instead of radioactive or other 
environmentally undesirable reagents. In the case of enzyme tags, colorimetric indicator 
substrates are known that can be employed to provide a detection means that is visibly or 
5 spectrophotometrically detectable, to identify specific hybridization with complementary 
nucleic acid containing samples. In other aspects, a particular nuclease cleavage site may 
be present and detection of a particular nucleotide sequence can be determined by the 
presence or absence of nucleic acid cleavage. 

In general, it is envisioned that the probes or primers described herein will be 

10 useful as reagents in solution hybridization, as in PGR, for detection of expression or 
genotype of corresponding genes, as well as in embodiments employing a solid phase. In 
embodiments involving a solid phase, the test DNA (or RNA) is adsorbed or otherwise 
affixed to a selected matrix or surface. This fixed, single-stranded nucleic acid is then 
subjected to hybridization with selected probes under desired conditions. The conditions 

15 selected will depend on the particular circumstances (depending, for example, on the 
G+C content, type of target nucleic acid, source of nucleic acid, size of hybridization 
probe, etc.). Optimization of hybridization conditions for the particular application of 
interest is well known to those of skill in the art. After washing of the hybridized 
molecules to remove non-specifically bound probe molecules, hybridization is detected, 

20 and/or quantified, by determining the amount of bound label. Representative solid phase 
hybridization methods are disclosed in U.S. Patents 5,843,663, 5,900,481 and 5,919,626. 
Other methods of hybridization that may be used in the practice of the present invention 
are disclosed in U.S. Patents 5,849,481, 5,849,486 and 5,851,772. The relevant portions 
of these and other references identified in this section of the Specification are 

25 incorporated herein by reference. 

B. Amplification of Nucleic Acids 

Nucleic acids used as a template for amplification may be isolated from cells, 
tissues or other samples according to standard methodologies (Sambrook et al, 2001). In 
certain embodiments, analysis is performed on whole cell or tissue homogenates or 
30 biological fluid samples with or without substantial purification of the template nucleic 

25369703.1 

-22- 



acid. The nucleic acid may be genomic DNA or fractionated or whole cell RNA. Where 
RNA is used, it may be desired to first convert the RNA to a complementary DNA. 

The term "primer," as used herein, is meant to encompass any nucleic acid that is 
capable of priming the synthesis of a nascent nucleic acid in a template-dependent 
5 process. Typically, primers are oligonucleotides from ten to twenty and/or thirty base 
pairs in length, but longer sequences can be employed. Primers may be provided in 
double-stranded and/or single-stranded form, although the single-stranded form is 
preferred. 

Pairs of primers designed to selectively hybridize to nucleic acids corresponding 

10 to the UGT1 gene locus (Genbank accession AF279093), UGT1A1 gene and/or SEQ ID 
NO:l or variants thereof, and fragments thereof are contacted with the template nucleic 
acid under conditions that permit selective hybridization. SEQ ID NO:l set forth a 
nucleotide sequence that includes a majority of the UGT1A1 gene. SEQ ID NO:l 
includes nucleotides 169,831 to 187,313 of the UGT1 gene locus with nucleotide 1645 of 

15 SEQ ID NO:l corresponding to nucleotide -3565 from the transcriptional start of the 
UGT1A1 gene, thus the transcriptional start is located at nucleotide 5212 of SEQ ID 
NO:l. Depending upon the desired application, high stringency hybridization conditions 
may be selected that will only allow hybridization to sequences that are completely 
complementary to the primers. In other embodiments, hybridization may occur under 

20 reduced stringency to allow for amplification of nucleic acids that contain one or more 
mismatches with the primer sequences. Once hybridized, the template-primer complex is 
contacted with one or more enzymes that facilitate template-dependent nucleic acid 
synthesis. Multiple rounds of amplification, also referred to as "cycles," are conducted 
until a sufficient amount of amplification product is produced. 

25 The amplification product may be detected, analyzed or quantified. In certain 

applications, the detection may be performed by visual means. In certain applications, 
the detection may involve indirect identification of the product via chemiluminescence, 
radioactive scintigraphy of incorporated radiolabel or fluorescent label or even via a 
system using electrical and/or thermal impulse signals (Affymax technology; Bellus, 

30 1994). 
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A number of template dependent processes are available to amplify the 
oligonucleotide sequences present in a given template sample. One of the best known 
amplification methods is the polymerase chain reaction (referred to as PCR™) which is 
described in detail in U.S. Patents 4,683,195, 4,683,202 and 4,800,159, and in Innis et al. 9 
5 1988, each of which is incorporated herein by reference in their entirety. 

Another method for amplification is ligase chain reaction ("LCR"), disclosed in 
European Application No. 320 308, incorporated herein by reference in its entirety. U.S. 
Patent 4,883,750 describes a method similar to LCR for binding probe pairs to a target 
sequence. A method based on PCR™ and oligonucleotide ligase assay (OLA) (described 
10 in further detail below), disclosed in U.S. Patent 5,912,148, may also be used. 

Alternative methods for amplification of target nucleic acid sequences that may 
be used in the practice of the present invention are disclosed in U.S. Patents 5,843,650, 
5,846,709, 5,846,783, 5,849,546, 5,849,497, 5,849,547, 5,858,652, 5,866,366, 5,916,776, 
5,922,574, 5,928,905, 5,928,906, 5,932,451, 5,935,825, 5,939,291 and 5,942,391, Great 
15 Britain Application 2 202 328, and in PCT Application PCT/US89/01025, each of which 
is incorporated herein by reference in its entirety. Qbeta Replicase, described in PCT 
Application PCT/US 87/008 80, may also be used as an amplification method in the 
present invention. 

An isothermal amplification method, in which restriction endonucleases and 
20 ligases are used to achieve the amplification of target molecules that contain nucleotide 
5-[alpha-thio]-triphosphates in one strand of a restriction site may also be useful in the 
amplification of nucleic acids in the present invention (Walker et al y 1992). Strand 
Displacement Amplification (SDA), disclosed in U.S. Patent 5,916,779, is another 
method of carrying out isothermal amplification of nucleic acids which involves multiple 
25 rounds of strand displacement and synthesis, i.e., nick translation 

Other nucleic acid amplification procedures include transcription-based 
amplification systems (TAS), including nucleic acid sequence based amplification 
(NASBA) and 3SR (Kwoh et aL, 1989; PCT Application WO 88/10315, incorporated 
herein by reference in their entirety). European Application 329 822 disclose a nucleic 
30 acid amplification process involving cyclically synthesizing single-stranded RNA 
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("ssRNA"), ssDNA, and double-stranded DNA (dsDNA), which may be used in 
accordance with the present invention. 

PCT Application WO 89/06700 (incorporated herein by reference in its entirety) 
disclose a nucleic acid sequence amplification scheme based on the hybridization of a 
5 promoter region/primer sequence to a target single-stranded DNA ("ssDNA") followed 
by transcription of many RNA copies of the sequence. This scheme is not cyclic, i.e., 
new templates are not produced from the resultant RNA transcripts. Other amplification 
methods include "RACE" and "one-sided PCR" (Frohman, 1990; Ohara et ai, 1989). 
C. Detection of Nucleic Acids 

10 Following any amplification, it may be desirable to separate the amplification 

product from the template and/or the excess primer. In one embodiment, amplification 
products are separated by agarose, agarose-acrylamide or polyacrylamide gel 
electrophoresis using standard methods (Sambrook et al. 9 2001). Separated amplification 
products may be cut out and eluted from the gel for further manipulation. Using low 

15 melting point agarose gels, the separated band may be removed by heating the gel, 
followed by extraction of the nucleic acid. 

Separation of nucleic acids may also be effected by spin columns and/or 
chromatographic techniques known in art. There are many kinds of chromatography 
which may be used in the practice of the present invention, including adsorption, 

20 partition, ion-exchange, hydroxylapatite, molecular sieve, reverse-phase, column, paper, 
thin-layer, and gas chromatography as well as HPLC. 

In certain embodiments, the amplification products are visualized, with or without 
separation. A typical visualization method involves staining of a gel with ethidium 
bromide and visualization of bands under UV light. Alternatively, if the amplification 

25 products are integrally labeled with radio- or fluorometrically-labeled nucleotides, the 
separated amplification products can be exposed to x-ray film or visualized under the 
appropriate excitatory spectra. 

In one embodiment, following separation of amplification products, a labeled 
nucleic acid probe is brought into contact with the amplified marker sequence. The probe 

30 preferably is conjugated to a chromophore but may be radiolabeled. In another 
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embodiment, the probe is conjugated to a binding partner, such as an antibody or biotin, 
or another binding partner carrying a detectable moiety. 

In particular embodiments, detection is by Southern blotting and hybridization 
with a labeled probe. The techniques involved in Southern blotting are well known to 
5 those of skill in the art (see Sambrook etal , 2001). One example of the foregoing is 
described in U.S. Patent 5,279,721, incorporated by reference herein, which discloses an 
apparatus and method for the automated electrophoresis and transfer of nucleic acids. 
The apparatus permits electrophoresis and blotting without external manipulation of the 
gel and is ideally suited to carrying out methods according to the present invention. 

10 Other methods of nucleic acid detection that may be used in the practice of the 

instant invention are disclosed in U.S. Patents 5,840,873, 5,843,640, 5,843,651, 
5,846,708, 5,846,717, 5,846,726, 5,846,729, 5,849,487, 5,853,990, 5,853,992, 5,853,993, 
5,856,092, 5,861,244, 5,863,732, 5,863,753, 5,866,331, 5,905,024, 5,910,407, 5,912,124, 
5,912,145, 5,919,630, 5,925,517, 5,928,862, 5,928,869, 5,929,227, 5,932,413 and 

1 5 5,935,79 1 , each of which is incorporated herein by reference. 
D. Other Assays 

Other methods for genetic screening may be used within the scope of the present 
invention, for example, to detect mutations in genomic DNA, cDNA and/or RNA 
samples. Methods used to detect point mutations include denaturing gradient gel 

20 electrophoresis ("DGGE"), restriction fragment length polymorphism analysis ("RFLP"), 
chemical or enzymatic cleavage methods, direct sequencing of target regions amplified 
by PCR™ (see above), single-strand conformation polymorphism analysis ("SSCP") and 
other methods well known in the art. 

One method of screening for point mutations is based on RNase cleavage of base 

25 pair mismatches in RNA/DNA or RNA/RNA heteroduplexes. As used herein, the term 
"mismatch" is defined as a region of one or more unpaired or mispaired nucleotides in a 
double-stranded RNA/RNA, RNA/DNA or DNA/DNA molecule. This definition thus 
includes mismatches due to insertion/deletion mutations, as well as single or multiple 
base point mutations. 

30 U.S. Patent 4,946,773 describes an RNase A mismatch cleavage assay that 

involves annealing single-stranded DNA or RNA test samples to an RNA probe, and 
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subsequent treatment of the nucleic acid duplexes with RNase A. For the detection of 
mismatches, the single-stranded products of the RNase A treatment, electrophoretically 
separated according to size, are compared to similarly treated control duplexes. Samples 
containing smaller fragments (cleavage products) not seen in the control duplex are 
5 scored as positive. 

Other investigators have described the use of RNase I in mismatch assays. The 
use of RNase I for mismatch detection is described in literature from Promega Biotech. 
Promega markets a kit containing RNase I that is reported to cleave three out of four 
known mismatches. Others have described using the MutS protein or other DNA-repair 

10 enzymes for detection of single-base mismatches. 

Alternative methods for detection of deletion, insertion or substitution mutations 
that may be used in the practice of the present invention are disclosed in U.S. Patents 
5,849,483, 5,851,770, 5,866,337, 5,925,525 and 5,928,870, each of which is incorporated 
herein by reference in its entirety. 

15 E. Specific Examples of SNP Screening Methods 

Spontaneous mutations that arise during the course of evolution in the genomes of 
organisms are often not immediately transmitted throughout all of the members of the 
species, thereby creating polymorphic alleles that co-exist in the species populations. 
Often polymorphisms are the cause of genetic diseases. Several classes of 

20 polymorphisms have been identified. For example, variable nucleotide type 
polymorphisms (VNTRs), arise from spontaneous tandem duplications of di- or 
trinucleotide repeated motifs of nucleotides. If such variations alter the lengths of DNA 
fragments generated by restriction endonuclease cleavage, the variations are referred to as 
restriction fragment length polymorphisms (RFLPs). RFLPs are been widely used in 

25 human and animal genetic analyses. 

Another class of polymorphisms are generated by the replacement of a single 
nucleotide. Such single nucleotide polymorphisms (SNPs) rarely result in changes in a 
restriction endonuclease site. Thus, SNPs are rarely detectable restriction fragment 
length analysis. SNPs are the most common genetic variations and occur once every 100 

30 to 300 bases and several SNP mutations have been found that affect a single nucleotide in 
a protein-encoding gene in a manner sufficient to actually cause a genetic disease. SNP 
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diseases are exemplified by hemophilia, sickle-cell anemia, hereditary hemochromatosis, 
late-onset alzheimer disease etc. 

In context of the present invention, polymorphic mutations that affect the activity 
and/or levels of the UGT1A1 gene products, which are responsible for the 
5 glucuronidation of irinotecan and other chemotherapeutic and xenobiotic agents, will be 
determined by a series of screening methods. One set of screening methods is aimed at 
identifying SNPs that affect the inducibility, activity and/or level of the UGT1A1 gene 
products in in vitro or in vivo assays. The other set of screening methods will then be 
performed to screen an individual for the occurrence of the SNPs identified above. To do 

10 this, a sample (such as blood or other bodily fluid or tissue sample) will be taken from a 
patient for genotype analysis. The presence or absence of SNPs will determine the ability 
of the screened individuals to metabolize irinotecan and other chemotherapeutic agents 
that are metabolized by the UGT1A1 gene products. According to methods provided by 
the invention, these results will be used to adjust and/or alter the dose of irinotecan or 

15 other agent administered to an individual in order to reduce drug side effects. 

SNPs can be the result of deletions, point mutations and insertions and in general 
any single base alteration, whatever the cause, can result in a SNP. The greater frequency 
of SNPs means that they can be more readily identified than the other classes of 
polymorphisms. The greater uniformity of their distribution permits the identification of 

20 SNPs "nearer" to a particular trait of interest. The combined effect of these two attributes 
makes SNPs extremely valuable. For example, if a particular trait (e.#., inability to 
efficiently metabolize irinotecan) reflects a mutation at a particular locus, then any 
polymorphism that is linked to the particular locus can be used to predict the probability 
that an individual will be exhibit that trait. 

25 Several methods have been developed to screen polymorphisms and some 

examples are listed below. The reference of Kwok and Chen (2003) and Kwok (2001) 
provide overviews of some of these methods; both of these references are specifically 
incorporated by reference. 

SNPs relating to glucuronidation of chemotherapeutic agents can be characterized 

30 by the use of any of these methods or suitable modification thereof. Such methods 
include the direct or indirect sequencing of the site, the use of restriction enzymes where 
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the respective alleles of the site create or destroy a restriction site, the use of allele- 
specific hybridization probes, the use of antibodies that are specific for the proteins 
encoded by the different alleles of the polymorphism, or any other biochemical 
interpretation. 
5 i) DNA Sequencing 

The most commonly used method of characterizing a polymorphism is direct 
DNA sequencing of the genetic locus that flanks and includes the polymorphism. Such 
analysis can be accomplished using either the "dideoxy-mediated chain termination 
method," also known as the "Sanger Method" (Sanger, F., et al 9 1975) or the "chemical 

10 degradation method," also known as the "Maxam-Gilbert method" (Maxam, A. M., et aL, 
1977). Sequencing in combination with genomic sequence-specific amplification 
technologies, such as the polymerase chain reaction may be utilized to facilitate the 
recovery of the desired genes (Mullis, K. et ai 9 1986; European Patent Application 
50,424; European Patent Application. 84,796, European Patent Application 258,017, 

15 European Patent Application. 237,362; European Patent Application. 201,184; U.S. 
Patents 4,683,202; 4,582,788; and 4,683,194), all of the above incorporated herein by 
reference. 

ii) Exonuclease Resistance 

Other methods that can be employed to determine the identity of a nucleotide 
20 present at a polymorphic site utilize a specialized exonuclease-resistant nucleotide 
derivative (U.S. Patent. 4,656,127). A primer complementary to an allelic sequence 
immediately 3'-to the polymorphic site is hybridized to the DNA under investigation. If 
the polymorphic site on the DNA contains a nucleotide that is complementary to the 
particular exonucleotide-resistant nucleotide derivative present, then that derivative will 
25 be incorporated by a polymerase onto the end of the hybridized primer. Such 
incorporation makes the primer resistant to exonuclease cleavage and thereby permits its 
detection. As the identity of the exonucleotide-resistant derivative is known one can 
determine the specific nucleotide present in the polymorphic site of the DNA. 

iii) Microsequencing Methods 

30 Several other primer-guided nucleotide incorporation procedures for assaying 

polymorphic sites in DNA have been described (Komher, J. S. et aL, 1989; Sokolov, B. 
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P., 1990; Syvanen 1990; Kuppuswamy et a/., 1991; Prezant et a/., 1992; Ugozzoll, L. et 
ai 9 1992; Nyren et al, 1993). These methods rely on the incorporation of labeled 
deoxynucleotides to discriminate between bases at a polymorphic site. As the signal is 
proportional to the number of deoxynucleotides incorporated, polymorphisms that occur 
5 in runs of the same nucleotide result in a signal that is proportional to the length of the 
run (Syvanen et a/., 1990). 

iv) Extension in Solution 

French Patent 2,650,840 and PCT Application WO91/02087 discuss a solution- 
based method for determining the identity of the nucleotide of a polymorphic site. 
10 According to these methods, a primer complementary to allelic sequences immediately 
3 f -to a polymorphic site is used. The identity of the nucleotide of that site is determined 
using labeled dideoxynucleotide derivatives which are incorporated at the end of the 
primer if complementary to the nucleotide of the polymorphic site. 

v) Genetic Bit Analysis or Solid-Phase Extension 

15 PCT Application W092/15712 describes a method that uses mixtures of labeled 

terminators and a primer that is complementary to the sequence 3* to a polymorphic site. 
The labeled terminator that is incorporated is complementary to the nucleotide present in 
the polymorphic site of the target molecule being evaluated and is thus identified. Here 
the primer or the target molecule is immobilized to a solid phase. 

20 vi) Oligonucleotide Ligation Assay (OLA) 

This is another solid phase method that uses different methodology (Landegren et 
ai 9 1988). Two oligonucleotides, capable of hybridizing to abutting sequences of a 
single strand of a target DNA are used. One of these oligonucleotides is biotinylated 
while the other is detectably labeled. If the precise complementary sequence is found in a 

25 target molecule, the oligonucleotides will hybridize such that their termini abut, and 
create a ligation substrate. Ligation permits the recovery of the labeled oligonucleotide 
by using avidin. Other nucleic acid detection assays, based on this method, combined 
with PCR have also been described (Nickerson et al. 9 1990). Here PCR is used to 
achieve the exponential amplification of target DNA, which is then detected using the 

30 OLA. 
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vii) Ligase/Polymerase-Mediated Genetic Bit Analysis 

U.S. Patent 5,952,174 describes a method that also involves two primers capable 
of hybridizing to abutting sequences of a target molecule. The hybridized product is 
formed on a solid support to which the target is immobilized. Here the hybridization 
5 occurs such that the primers are separated from one another by a space of a single 
nucleotide. Incubating this hybridized product in the presence of a polymerase, a ligase, 
and a nucleoside triphosphate mixture containing at least one deoxynucleoside 
triphosphate allows the ligation of any pair of abutting hybridized oligonucleotides. 
Addition of a ligase results in two events required to generate a signal, extension and 
10 ligation. This provides a higher specificity and lower "noise" than methods using either 
extension or ligation alone and unlike the polymerase-based assays, this method enhances 
the specificity of the polymerase step by combining it with a second hybridization and a 
ligation step for a signal to be attached to the solid phase. 

viii) Other Methods To Detect SNPs 

15 Several other specific methods for SNP detection and identification are presented 

below and may be used as such or with suitable modifications in conjunction with 
identifying polymorphisms of the UGT1A1 genes in the present invention. Several other 
methods are also described on the SNP web site of the NCBI at the website 
www.ncbi.nlm.nih.gov/SNP, incorporated herein by reference. 

20 In a particular embodiment, extended haplotypes may be determined at any given 

locus in a population, which allows one to identify exactly which SNPs will be redundant 
and which will be essential in association studies. The latter is referred to as Tiaplotype 
tag SNPs (htSNPs)', markers that capture the haplotypes of a gene or a region of linkage 
disequilibrium. See Johnson et ah (2001) and Ke and Cardon (2003), each of which is 

25 incorporated herein by reference, for exemplary methods. 

The VDA-assay utilizes PCR amplification of genomic segments by long PCR 
methods using TaKaRa LA Taq reagents and other standard reaction conditions. The 
long amplification can amplify DNA sizes of about 2,000-12,000 bp. Hybridization of 
products to variant detector array (VDA) can be performed by a Affymetrix High 

30 Throughput Screening Center and analyzed with computerized software. 
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A method called Chip Assay uses PCR amplification of genomic segments by 
standard or long PCR protocols. Hybridization products are analyzed by VDA, Halushka 
et a/., 1999, incorporated herein by reference. SNPs are generally classified as "Certain" 
or "Likely" based on computer analysis of hybridization patterns. By comparison to 
5 alternative detection methods such as nucleotide sequencing, "Certain" SNPs have been 
confirmed 100% of the time; and "Likely" SNPs have been confirmed 73% of the time by 
this method. 

Other methods simply involve PCR amplification following digestion with the 
relevant restriction enzyme. Yet others involve sequencing of purified PCR products 

10 from known genomic regions. 

In yet another method, individual exons or overlapping fragments of large exons 
are PCR-amplified. Primers are designed from published or database sequences and 
PCR-amplification of genomic DNA is performed using the following conditions: 200 ng 
DNA template, 0.5|iM each primer, 80^iM each of dCTP, dATP, dTTP and dGTP, 5% 

15 formamide, 1.5mM MgCl 2 , 0.5U of Taq polymerase and 0.1 volume of the Taq buffer. 
Thermal cycling is performed and resulting PCR-products are analyzed by PCR-single 
strand conformation polymorphism (PCR-SSCP) analysis, under a variety of conditions, 
e.g 9 5 or 10% polyacrylamide gel with 15% urea, with or without 5% glycerol. 
Electrophoresis is performed overnight. PCR-products that show mobility shifts are 

20 reamplified and sequenced to identify nucleotide variation. 

In a method called CGAP-GAI (DEMIGLACE), sequence and alignment data 
(from a PHRAP.ace file), quality scores for the sequence base calls (from PHRED quality 
files), distance information (from PHYLIP dnadist and neighbour programs) and base- 
calling data (from PHRED -d' switch) are loaded into memory. Sequences are aligned 

25 and examined for each vertical chunk ('slice 1 ) of the resulting assembly for disagreement. 
Any such slice is considered a candidate SNP (DEMIGLACE). A number of filters are 
used by DEMIGLACE to eliminate slices that are not likely to represent true 
polymorphisms. These include filters that: (i) exclude sequences in any given slice from 
SNP consideration where neighboring sequence quality scores drop 40% or more; (ii) 

30 exclude calls in which peak amplitude is below the fifteenth percentile of all base calls 
for that nucleotide type; (iii) disqualify regions of a sequence having a high number of 
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disagreements with the consensus from participating in SNP calculations; (iv) removed 
from consideration any base call with an alternative call in which the peak takes up 25% 
or more of the area of the called peak; (v) exclude variations that occur in only one read 
direction. PHRED quality scores were converted into probability-of-error values for each 
5 nucleotide in the slice. Standard Baysian methods are used to calculate the posterior 
probability that there is evidence of nucleotide heterogeneity at a given location. 

In a method called CU-RDF (RESEQ), PCR amplification is performed from 
DNA isolated from blood using specific primers for each SNP, and after typical cleanup 
protocols to remove unused primers and free nucleotides, direct sequencing using the 

1 0 same or nested primers. 

In a method called DEBNICK (METHOD-B), a comparative analysis of clustered 
EST sequences is performed and confirmed by fluorescent-based DNA sequencing. In a 
related method, called DEBNICK (METHOD-C), comparative analysis of clustered EST 
sequences with phred quality > 20 at the site of the mismatch, average phred quality >= 

15 20 over 5 bases 5 '-FLANK and 3' to the SNP, no mismatches in 5 bases 5 1 and 3' to the 
SNP, at least two occurrences of each allele is performed and confirmed by examining 
traces. 

In a method identified by ERO (RESEQ), new primers sets are designed for 
electronically published STSs and used to amplify DNA from 10 different mouse strains. 
20 The amplification product from each strain is then gel purified and sequenced using a 

standard dideoxy, cycle sequencing technique with 33p_i a fceled terminators. All the 
ddATP terminated reactions are then loaded in adjacent lanes of a sequencing gel 
followed by all of the ddGTP reactions and so on. SNPs are identified by visually 
scanning the radiographs. 

25 In another method identified as ERO (RESEQ-HT), new primers sets are designed 

for electronically published murine DNA sequences and used to amplify DNA from 10 
different mouse strains. The amplification product from each strain is prepared for 
sequencing by treating with Exonuclease I and Shrimp Alkaline Phosphatase. 
Sequencing is performed using ABI Prism Big Dye Terminator Ready Reaction Kit 

30 (Perkin-Elmer) and sequence samples are run on the 3700 DNA Analyzer (96 Capillary 
Sequencer). 
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FGU-CBT (SCA2-SNP) identifies a method where the region containing the SNP 
is PCR amplified using the primers SCA2-FP3 and SCA2-RP3. Approximately 100 ng 
of genomic DNA is amplified in a 50 ml reaction volume containing a final concentration 
of 5mM Tris, 25mM KC1, 0.75mM MgCl 2 , 0.05% gelatin, 20pmol of each primer and 
5 0.5U of Taq DNA polymerase. Samples are denatured, annealed and extended and the 
PCR product is purified from a band cut out of the agarose gel using, for example, the 
QIAquick gel extraction kit (Qiagen) and is sequenced using dye terminator chemistry on 
an ABI Prism 377 automated DNA sequencer with the PCR primers. 

In a method identified as JBLACK (SEQ/RESTRICT), two independent PCR 

10 reactions are performed with genomic DNA. Products from the first reaction are 
analyzed by sequencing, indicating a unique Fspl restriction site. The mutation is 
confirmed in the product of the second PCR reaction by digesting with Fsp I. 

In a method described as KWOK(l), SNPs are identified by comparing high 
quality genomic sequence data from four randomly chosen individuals by direct DNA 

15 sequencing of PCR products with dye-terminator chemistry (see Kwok et al., 1996). In a 
related method identified as KWOK (2) SNPs) are identified by comparing high quality 
genomic sequence data from overlapping large-insert clones such as bacterial artificial 
chromosomes (BACs) or PI -based artificial chromosomes (PACs). An STS containing 
this SNP is then developed and the existence of the SNP in various populations is 

20 confirmed by pooled DNA sequencing (see Taillon-Miller et al, 1998). In another 
similar method called KWOK(3), SNPs are identified by comparing high quality genomic 
sequence data from overlapping large-insert clones BACs or PACs. The SNPs found by 
this approach represent DNA sequence variations between the two donor chromosomes 
but the allele frequencies in the general population have not yet been determined. In 

25 method KWOK(5), SNPs are identified by comparing high quality genomic sequence 
data from a homozygous DNA sample and one or more pooled DNA samples by direct 
DNA sequencing of PCR products with dye-terminator chemistry. The STSs used are 
developed from sequence data found in publicly available databases. Specifically, these 
STSs are amplified by PCR against a complete hydatidiform mole (CHM) that has been 

30 shown to be homozygous at all loci and a pool of DNA samples from 80 CEPH parents 
(see Kwok etal, 1994). 
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In another such method, KWOK (OverlapSnpDetectionWithPolyBayes), SNPs 
are discovered by automated computer analysis of overlapping regions of large-insert 
human genomic clone sequences. For data acquisition, clone sequences are obtained 
directly from large-scale sequencing centers. This is necessary because base quality 
5 sequences are not present/available through GenBank. Raw data processing involves 
analyzed of clone sequences and accompanying base quality information for consistency. 
Finished Chase perfect', error rate lower than 1 in 10,000 bp) sequences with no 
associated base quality sequences are assigned a uniform base quality value of 40 (1 in 
10,000 bp error rate). Draft sequences without base quality values are rejected. 

10 Processed sequences are entered into a local database. A version of each sequence with 
known human repeats masked is also stored. Repeat masking is performed with the 
program "MASKERMD." Overlap detection: Putative overlaps are detected with the 
program "WUBLAST." Several filtering steps followed in order to eliminate false 
overlap detection results, i.e. similarities between a pair of clone sequences that arise due 

15 to sequence duplication as opposed to true overlap. Total length of overlap, overall 
percent similarity, number of sequence differences between nucleotides with high base 
quality value "high-quality mismatches." Results are also compared to results of 
restriction fragment mapping of genomic clones at Washington University Genome 
Sequencing Center, finisher's reports on overlaps, and results of the sequence contig 

20 building effort at the NCBI. SNP detection: Overlapping pairs of clone sequence are 
analyzed for candidate SNP sites with the 'POLYBAYES' SNP detection software. 
Sequence differences between the pair of sequences are scored for the probability of 
representing true sequence variation as opposed to sequencing error. This process 
requires the presence of base quality values for both sequences. High-scoring candidates 

25 are extracted. The search is restricted to substitution-type single base pair variations. 
Confidence score of candidate SNP is computed by the POLYBAYES software. 

In method identified by KWOK (TaqMan assay), the TaqMan assay is used to 
determine genotypes for 90 random individuals. In method identified by KYUGEN(Ql), 
DNA samples of indicated populations are pooled and analyzed by PLACE-SSCP. Peak 

30 heights of each allele in the pooled analysis are corrected by those in a heterozygote, and 
are subsequently used for calculation of allele frequencies. Allele frequencies higher 
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than 10% are reliably quantified by this method. Allele frequency = 0 (zero) means that 
the allele was found among individuals, but the corresponding peak is not seen in the 
examination of pool. Allele frequency = 0-0.1 indicates that minor alleles are detected in 
the pool but the peaks are too low to reliably quantify. 
5 In yet another method identified as KYUGEN (Methodl), PCR products are post- 

labeled with fluorescent dyes and analyzed by an automated capillary electrophoresis 
system under SSCP conditions (PLACE-SSCP). Four or more individual DNAs are 
analyzed with or without two pooled DNA (Japanese pool and CEPH parents pool) in a 
series of experiments. Alleles are identified by visual inspection. Individual DNAs with 

10 different genotypes are sequenced and SNPs identified. Allele frequencies are estimated 
from peak heights in the pooled samples after correction of signal bias using peak heights 
in heterozygotes. For the PCR primers are tagged to have S'-ATT or 5 f -GTT at their ends 
for post-labeling of both strands. Samples of DNA (10 ng/ul) are amplified in reaction 
mixtures containing the buffer (lOmM Tris-HCl, pH 8.3 or 9.3, 50mM KC1, 2.0mM 

15 MgCl 2 ), 0.25nM of each primer, 200jaM of each dNTP, and 0.025 units/jil of Taq DNA 
polymerase premixed with anti-Taq antibody. The two strands of PCR products are 
differentially labeled with nucleotides modified with R110 and R6G by an exchange 
reaction of Klenow fragment of DNA polymerase I. The reaction is stopped by adding 
EDTA, and unincorporated nucleotides are dephosphorylated by adding calf intestinal 

20 alkaline phosphatase. For the SSCP: an aliquot of fluorescently labeled PCR products 
and TAMRA-labeled internal markers are added to deionized formamide, and denatured. 
Electrophoresis is performed in a capillary using an ABI Prism 310 Genetic Analyzer. 
Genescan softwares (P-E Biosystems) are used for data collection and data processing. 
DNA of individuals (two to eleven) including those who showed different genotypes on 

25 SSCP are subjected for direct sequencing using big-dye terminator chemistry, on ABI 
Prism 310 sequencers. Multiple sequence trace files obtained from ABI Prism 310 are 
processed and aligned by Phred/Phrap and viewed using Consed viewer. SNPs are 
identified by PolyPhred software and visual inspection. 

In yet another method identified as KYUGEN (Method2), individuals with 

30 different genotypes are searched by denaturing HPLC (DHPLC) or PLACE-SSCP 
(Inazuka et aL, 1997) and their sequences are determined to identify SNPs. PCR is 
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performed with primers tagged with 5-ATT or 5-GTT at their ends for post-labeling of 
both strands. DHPLC analysis is carried out using the WAVE DNA fragment analysis 
system (Transgenomic). PCR products are injected into DNASep column, and separated 
under the conditions determined using WAVEMaker program (Transgenomic). The two 
5 strands of PCR products that are differentially labeled with nucleotides modified with 
Rl 10 and R6G by an exchange reaction of Klenow fragment of DNA polymerase I. The 
reaction is stopped by adding EDTA, and unincorporated nucleotides are 
dephosphorylated by adding calf intestinal alkaline phosphatase. SSCP followed by 
electrophoresis is performed in a capillary using an ABI Prism 310 Genetic Analyzer. 

10 Genescan softwares (P-E Biosystems). DNA of individuals including those who showed 
different genotypes on DHPLC or SSCP are subjected for direct sequencing using big- 
dye terminator chemistry, on ABI Prism 310 sequencer. Multiple sequence trace files 
obtained from ABI Prism 310 are processed and aligned by Phred/Phrap and viewed 
using Consed viewer. SNPs are identified by PolyPhred software and visual inspection. 

15 Trace chromatogram data of EST sequences in Unigene are processed with PHRED. To 
identify likely SNPs, single base mismatches are reported from multiple sequence 
alignments produced by the programs PHRAP, BRO and POA for each Unigene cluster. 
BRO corrected possible misreported EST orientations, while POA identified and 
analyzed non-linear alignment structures indicative of gene mixing/chimeras that might 

20 produce spurious SNPs. Bayesian inference is used to weigh evidence for true 
polymorphism versus sequencing error, misalignment or ambiguity, misclustering or 
chimeric EST sequences, assessing data such as raw chromatogram height, sharpness, 
overlap and spacing; sequencing error rates; context-sensitivity; cDNA library origin, etc. 
In method identified as MARSHFIELD(Method-B), overlapping human DNA 

25 sequences which contained putative insertion/deletion polymorphisms are identified 
through searches of public databases. PCR primers which flanked each polymorphic site 
are selected from the consensus sequences. Primers are used to amplify individual or 
pooled human genomic DNA. Resulting PCR products are resolved on a denaturing 
polyacrylamide gel and a Phosphorhnager is used to estimate allele frequencies from 

30 DNA pools. 
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IV. PHARMACEUTICAL COMPOSITIONS 

Aqueous compositions may have an effective amount of irinotecan and/or an 
effective amount of a compound (second agent) that increases conjugative enzyme 
activity, as represented by a compound that increases the activity of the phase II 
5 conjugative enzyme, glucuronosyltransferase or that decreases biliary transport. Such 
compositions will generally be dissolved or dispersed in a pharmaceutically acceptable 
carrier or aqueous medium. 

The phrases "pharmaceutically or pharmacologically acceptable" refer to 
molecular entities and compositions that do not produce an adverse, allergic or other 
10 untoward reaction when administered to an animal, or human, as appropriate. As used 
herein, "pharmaceutically acceptable carrier" includes any and all solvents, dispersion 
media, coatings, antibacterial and antifungal agents, isotonic and absorption delaying 
agents and the like. The use of such media and agents for pharmaceutical active 
substances is well known in the art. Except insofar as any conventional media or agent is 
15 incompatible with the active ingredients, its use in the therapeutic compositions is 
contemplated. Supplementary active ingredients, such as other anti-cancer agents, can 
also be incorporated into the compositions. 

In addition to the compounds formulated for parenteral administration, such as 
intravenous or intramuscular injection, other pharmaceutically acceptable forms include, 
20 e.g., tablets or other solids for oral administration; time release capsules; and any other 
form currently used, including cremes, lotions, mouthwashes, inhalants and the like. 
A. Parenteral Administration 

The active compounds will often be formulated for parenteral administration, e.g., 
formulated for injection via the intravenous, intramuscular, sub-cutaneous, or even 

25 intraperitoneal routes. The preparation of an aqueous composition that contains 
irinotecan and a second agent as active ingredients will be known to those of skill in the 
art in light of the present disclosure. Typically, such compositions can be prepared as 
injectables, either as liquid solutions or suspensions; solid forms . suitable for using to 
prepare solutions or suspensions upon the addition of a liquid prior to injection can also 

30 be prepared; and the preparations can also be emulsified. 
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Solutions of the active compounds as free base or pharmacologically acceptable 
salts can be prepared in water suitably mixed with a surfactant, such as 
hydroxypropylcellulose. Dispersions can also be prepared in glycerol, liquid 
polyethylene glycols, and mixtures thereof and in oils. Under ordinary conditions of 
5 storage and use, these preparations contain a preservative to prevent the growth of 
microorganisms. 

The pharmaceutical forms suitable for injectable use include sterile aqueous 
solutions or dispersions; formulations including sesame oil, peanut oil or aqueous 
propylene glycol; and sterile powders for the extemporaneous preparation of sterile 

10 injectable solutions or dispersions. In all cases the form must be sterile and must be fluid 
to the extent that easy syringability exists. It must be stable under the conditions of 
manufacture and storage and must be preserved against the contaminating action of 
microorganisms, such as bacteria and fungi. 

The active compounds may be formulated into a composition in a neutral or salt 

15 form. Pharmaceutically acceptable salts, include the acid addition salts (formed with the 
free amino groups of the protein) and which are formed with inorganic acids such as, for 
example, hydrochloric or phosphoric acids, or such organic acids as acetic, oxalic, 
tartaric, mandelic, and the like. Salts formed with the free carboxyl groups can also be 
derived from inorganic bases such as, for example, sodium, potassium, ammonium, 

20 calcium, or ferric hydroxides, and such organic bases as isopropylamine, trimethylamine, 
histidine, procaine and the like. 

The carrier can also be a solvent or dispersion medium containing, for example, 
water, ethanol, polyol (for example, glycerol, propylene glycol, and liquid polyethylene 
glycol, and the like), suitable mixtures thereof, and vegetable oils. The proper fluidity 

25 can be maintained, for example, by the use of a coating, such as lecithin, by the 
maintenance of the required particle size in the case of dispersion and by the use of 
surfactants. The prevention of the action of microorganisms can be brought about by 
various antibacterial ad antifungal agents, for example, parabens, chlorobutanol, phenol, 
sorbic acid, thimerosal, and the like. In many cases, it will be preferable to include 

30 isotonic agents, for example, sugars or sodium chloride. Prolonged absorption of the 
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injectable compositions can be brought about by the use in the compositions of agents 
delaying absorption, for example, aluminum monostearate and gelatin. 

Sterile injectable solutions are prepared by incorporating the active compounds in 
the required amount in the appropriate solvent with various of the other ingredients 
5 enumerated above, as required, followed by filtered sterilization. Generally, dispersions 
are prepared by incorporating the various sterilized active ingredients into a sterile 
vehicle which contains the basic dispersion medium and the required other ingredients 
from those enumerated above. In the case of sterile powders for the preparation of sterile 
injectable solutions, the preferred methods of preparation are vacuum-drying and freeze- 

10 drying techniques which yield a powder of the active ingredient plus any additional 
desired ingredient from a previously sterile-filtered solution thereof. 

Upon formulation, solutions will be administered in a manner compatible with the 
dosage formulation and in such amount as is therapeutically effective. The formulations 
are easily administered in a variety of dosage forms, such as the type of injectable 

15 solutions described above, with even drug release capsules and the like being 
employable. 

For parenteral administration in an aqueous solution, for example, the solution 
should be suitably buffered if necessary and the liquid diluent first rendered isotonic with 
sufficient saline or glucose. These particular aqueous solutions are especially suitable for 

20 intravenous, intramuscular, subcutaneous and intraperitoneal administration. In this 
connection, sterile aqueous media which can be employed will be known to those of skill 
in the art in light of the present disclosure. For example, one dosage could be dissolved 
in lmL of isotonic NaCl solution and either added to lOOOmL of hypodermoclysis fluid 
or injected at the proposed site of infusion, (see for example, "Remington's 

25 Pharmaceutical Sciences" 15th Edition, pages 1035-1038 and 1570-1580). Some 
variation in dosage will necessarily occur depending on the condition of the subject being 
treated. The person responsible for administration will, in any event, determine the 
appropriate dose for the individual subject. 
B. Oral Administration 

30 In certain embodiments, active compounds may be administered orally. This is 

contemplated for agents which are generally resistant, or have been rendered resistant, to 
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proteolysis by digestive enzymes. Such compounds are contemplated to include all those 
compounds, or drugs, that are available in tablet form from the manufacturer and 
derivatives and analogues thereof. 

For oral administration, the active compounds may be administered, for example, 
5 with an inert diluent or with an assimilable edible carrier, or they may be enclosed in hard 
or soft shell gelatin capsule, or compressed into tablets, or incorporated directly with the 
food of the diet. For oral therapeutic administration, the active compounds may be 
incorporated with excipients and used in the form of ingestible tablets, buccal tables, 
troches, capsules, elixirs, suspensions, syrups, wafers, and the like. Such compositions 
10 and preparations should contain at least 0.1% of active compound. The percentage of the 
compositions and preparations may, of course, be varied and may conveniently be 
between about 2 to about 60% of the weight of the unit. The amount of active 
compounds in such therapeutically useful compositions is such that a suitable dosage will 
be obtained. 

15 The tablets, troches, pills, capsules and the like may also contain the following: a 

binder, as gum tragacanth, acacia, cornstarch, or gelatin; excipients, such as dicalcium 
phosphate; a disintegrating agent, such as corn starch, potato starch, alginic acid and the 
like; a lubricant, such as magnesium stearate; and a sweetening agent, such as sucrose, 
lactose or saccharin may be added or a flavoring agent, such as peppermint, oil of 

20 wintergreen, or cherry flavoring. When the dosage unit form is a capsule, it may contain, 
in addition to materials of the above type, a liquid carrier. Various other materials may 
be present as coatings or to otherwise modify the physical form of the dosage unit. For 
instance, tablets, pills, or capsules may be coated with shellac, sugar or both. A syrup of 
elixir may contain the active compounds sucrose as a sweetening agent methyl and 

25 propylparabens as preservatives, a dye and flavoring, such as cherry or orange flavor. Of 
course, any material used in preparing any dosage unit form should be pharmaceutically 
pure and substantially non-toxic in the amounts employed. In addition, the active 
compounds may be incorporated into sustained-release preparation and formulations. 

Upon formulation, the compounds will be administered in a manner compatible 

30 with the dosage formulation and in such amount as is therapeutically effective. The 
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formulations are easily administered in a variety of dosage forms, such as those described 
below in specific examples. 
C. Liposomes 

In a particular embodiment, liposomal formulations are contemplated. Liposomal 
5 encapsulation of pharmaceutical agents prolongs their half-lives when compared to 
conventional drug delivery systems. Because larger quantities can be protectively 
packaged, this allow the opportunity for dose-intensity of agents so delivered to cells. 
This would be particularly attractive in the chemotherapy of cervical cancer if there were 
mechanisms to specifically enhance the cellular targeting of such liposomes to these 
10 cells. 

"Liposome" is a generic term encompassing a variety of single and multilamellar 
lipid vehicles formed by the generation of enclosed lipid bilayers. Phospholipids are used 
for preparing the liposomes according to the present invention and can carry a net 
positive charge, a net negative charge or are neutral. Dicetyl phosphate can be employed 

15 to confer a negative charge on the liposomes, and stearylamine can be used to confer a 
positive charge on the liposomes. Liposomes are characterized by a phospholipid bilayer 
membrane and an inner aqueous medium. Multilamellar liposomes have multiple lipid 
layers separated by aqueous medium. They form spontaneously when phospholipids are 
suspended in an excess of aqueous solution. The lipid components undergo self- 

20 rearrangement before the formation of closed structures and entrap water and dissolved 
solutes between the lipid bilayers (Ghosh and Bachhawat, 1991). Also contemplated are 
cationic lipid-nucleic acid complexes, such as lipofectamine-nucleic acid complexes 

V. KITS 

25 Any of the compositions described herein may be comprised in a kit. In a non- 

limiting example, reagents for determining the genotype of one or both UGT1A1 genes 
are included in a kit. The kit may further include individual nucleic acids that can be 
amplify and/or detect particular nucleic acid sequences the UGT1A1 gene. It may also 
include one or more buffers, such as a DNA isolation bufffers, an amplification buffer or 

30 a hybridization buffer. The kit may also contain compounds and reagents to prepare 
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DNA templates and isolate DNA from a sample. The kit may also include various 
labeling reagents and compounds. 

The components of the kits may be packaged either in aqueous media or in 
lyophilized form. The container means of the kits will generally include at least one vial, 
5 test tube, flask, bottle, syringe or other container means, into which a component may be 
placed, and preferably, suitably aliquoted. Where there are more than one component in 
the kit (labeling reagent and label may be packaged together), the kit also will generally 
contain a second, third or other additional container into which the additional components 
may be separately placed. However, various combinations of components may be 

10 comprised in a vial. The kits of the present invention also will typically include a means 
for containing the nucleic acids, and any other reagent containers in close confinement 
for commercial sale. Such containers may include injection or blow-molded plastic 
containers into which the desired vials are retained. 

When the components of the kit are provided in one and/or more liquid solutions, 

15 the liquid solution is an aqueous solution, with a sterile aqueous solution being 
particularly preferred. However, the components of the kit may be provided as dried 
powder(s). When reagents and/or components are provided as a dry powder, the powder 
can be reconstituted by the addition of a suitable solvent. It is envisioned that the solvent 
may also be provided in another container means. 

20 A kit will also include instructions for employing the kit components as well the 

use of any other reagent not included in the kit. Instructions may include variations that 
can be implemented. 

It is contemplated that such reagents are embodiments of kits of the invention. 
Such kits, however, are not limited to the particular items identified above and may 
25 include any reagent used directly or indirectly in the detection of polymorphisms in the 
UGT1A1 gene or the activity level of the UGT1A1 polypeptide. 

EXAMPLES 

The following examples are included to demonstrate preferred embodiments of 
30 the invention. It should be appreciated by those of skill in the art that the techniques 
disclosed in the examples which follow represent techniques discovered by the inventor 
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to function well in the practice of the invention, and thus can be considered to constitute 
preferred modes for its practice. However, those of skill in the art should, in light of the 
present disclosure, appreciate that many changes can be made in the specific 
embodiments which are disclosed and still obtain a like or similar result without 
5 departing from the spirit and scope of the invention. 

EXAMPLE 1 

MATERIALS AND METHODS FOR EXAMPLES 2-5 
Chemicals and reagents 

10 Exonuclease I and shrimp alkaline phosphatase (exo/SAP) was purchased from 

USB (Cleveland, Ohio, USA). ABI Big Dye terminator cycle-sequencing kit was 
purchased from Applied Biosystems (Foster City, California, USA). Primers for 
amplification, sequencing of the PBREM, and amplification of the (TA)n polymorphism 
were obtained from GibcoBRL (Invitrogen Co., Carlsbad, California, USA). SN-38 was 

15 kindly provided by Dr Kiyoshi Terada (Yakult Honsha Co., Ltd, Japan). Camptothecin, 
UDPGA, magnesium chloride, trizma base, potassium monohydrogen phosphate and 1- 
heptanesulfonic acid were purchased from Sigma-Aldrich (St. Louis, Missouri, USA). 
Acetonitrile, tetrahydrofiiran and hydrochloric acid were obtained from Fisher Scientific 
(Hanover, Illinois, USA). 

20 

Human livers 

Normal human livers (n = 83) were mainly obtained from Liver Tissue 
Procurement and Distribution System (National Institutes of Diabetes and Digestive and 
Kidney Diseases, Minneapolis, Minnesota). DNA was isolated by using Qiagen 

25 RNA/DNA Maxi Kit (Qiagen Inc., Valencia, California, USA), and microsomes were 
isolated following differential centrifugation methods (Purba et al 9 1987). DNA and 
microsomes were provided by the Liver Core Bank Facility (St. Jude Children's Research 
Hospital) of the Pharmacogenetics of Anticancer Agents Research (PAAR) Group. In 
order to identify livers in which enzyme degradation occurred, liver samples consistently 

30 comprised in the 10th percentile of UGT1A1, UGT1A9 and UGT2B7 activities were 
sought. UGT1A9 and UGT2B7 activities were measured using specific probes (data not 
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shown) (Ramirez et al. y 2002 and Innocenti, et al, 2001). Out of eight samples within 
the 10th percentile of UGT1A1, only one sample was comprised within the 10th percentile 
of activities of the other two enzyme activities. If different handling/storage of the liver 
or microsomal protein degradation occurred in that sample, this should not have affected 
5 the degree of phenotype/genotype correlation because the individual had a 7/7 genotype, 
and among the 7/7 genotype samples (n = 1 1), it had the 4th lower value. Moreover, lack 
of correlation between UGT1A1 and UGT2B7 activities (n = 83, r = 0.07, P = 0.5) shows 
that differences in tissue integrity and microsome stability have probably a mild influence 
(if any) on the UGT phenotype. 
10 The ethnic composition of the 83 liver donors comprised: Caucasians 68%, 

African- Americans 18%, Asians 1%, others 2%. The percentage of samples of unknown 
ethnic origin was 12%. 

Genotyping of (TA)„ polymorphism 

15 In order to genotype the (TA)n polymorphism, approximately 40 ng of DNA was 

subjected to amplification by polymerase chain reaction (PCR). The amplification 
primers used have been previously described (Monaghan et al 9 1996), where the 
sequence of the forward primer is 5 -GTC ACGTGAC AC AGTCAAAC-3 f (SEQ ID 
NO:2) and that of the reverse primer is 5 f -TTTGCTCCTGCCAGAGGTT-3' (SEQ ID 

20 NO:3). These primers flank the polymorphic TA locus in the promoter region of the 
UGT1A1 gene and amplify a 98bp fragment when a (TA) 6 allele is present and a 100 bp 
fragment when a (TA) 7 allele is present. In the presence of (TA) 5 and (TA) 8 alleles, 96 bp 
and 102 bp alleles are amplified. The reverse primer is labeled with a fluorescent dye at 
its 5*-end to permit visualization of the amplification product. The amplification 

25 reactions were performed in a lOjal volume consisting of 1.5 mmol MgCh, 250 mmol 
dNTPs, 0.8 mmol of each primer and 0.5 U of Taq polymerase (Amplitaq Gold from 
Applied Biosystems). The polymerase was activated at 95°C for 10 min and DNA 
amplified for 35 cycles at 95°C for 30 sec, 55°C for 30 sec and 72°C for 30 sec, followed 
by a final extension at 72°C for 10 min. Control DNAs from individuals known to have a 

30 6/6, 6/7 and 7/7 genotype were included in the PCR analysis. PCR fragments were 
subjected to gel electrophoresis on an ABI 377 DNA analyzer (Applied Biosystems). 
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Amplified products were diluted in a formamide and dextran blue loading buffer and 1 ^1 
combined with 1 of size standard (GS-350 from Applied Biosystems), denatured at 
95°C, and loaded onto a 6% denaturing polyacrylamide gel. Electrophoresis was 
performed for 3.5 hours following manufacturers recommendations. The Genescan and 
5 Genotyper software (version 3.7, Applied Biosystems) was used to analyze fragments for 
size determination. 

Sequencing of PBREM 

A 606 bp region (-3641 to -3036) including PBREM was successfully PCR- 

10 amplified and sequenced in 81 of the 83 human liver DNAs and 22 of the 24 DNA 
samples from African-American individuals (Americans of African descent, born in the 
USA) included in the NIGMS HGCR Human VariationPanel (Coriell Institute for 
Medical Research, Camden, New Jersey, USA). The reference sequence shown in Fig. 1 
is that deposited in the GenBank database (accession number AF3 13454). Amplification 

15 of the PCR product was performed in a 10 or 25-pl reaction volume using the following 
primers: 5 , -CTGGGGATAAACATGGGATG-3 ! SEQ ID NO:4 (forward) and 5'- 
CACCACCACTTCTGGAACCT-3' SEQ ID NO:5 (reverse). These primers were 
designed using Primer3 software (Rozen et al. 9 1998). PCR conditions were 2 min at 
94°C, 32 or 33 cycles of a three-step cycling program (30 sec at 94°C, 30 sec at 66.8°C 

20 and 1 min at 72°C) and 72°C for 3 min. Following exo/SAP cleanup of the PCR product, 
this amplicon was then sequenced in the forward and reverse directions using the 
amplification primers, Big Dye terminator chemistry, and run on an ABI 3700 (Applied 
Biosystems) following the manufacturer's protocol. Sequences were analyzed and 
individuals genotyped using the Poly- Phred software (Nickerson et ai, 1997). To 

25 determine the ancestral state of the polymorphisms found in humans, the sequence was 
compared to that of baboon (accession number AC091778). 

SN- 38 glucuronidation assay in human liver microsomes 

Samples were phenotyped by using SN-38 as a substrate for UGT1A1. The 
30 incubation mixture consisted of 5 nmol SN-38, 10 mmol MgCl 2 , 1 mg/ml microsomes, 
0.025 mol Tris-HCl (pH 7.4) and 5 mmol UDP-GA. Samples were incubated for 30 min 

25369703.1 

-46- 



at 37°C. The reaction was stopped by the addition of methanol. These conditions were 
selected after previous optimization of the enzyme reaction (Iyer et al: 9 1998). 
Camptothecin (75 ng) was used as an internal standard. SN-38 glucuronidation was 
measured by HPLC (Hitachi Instruments Inc., San Jose, California, USA) with 
5 fluorescence detection (A, excitation = 355 nm, X emission = 515 nm). A ^Boiidapak™ 
Cig column (3.9 X 300 mm, 10 |im; Waters Corp., Milford, Massachusetts, USA) and 
jiBondapak™ Cig guardpak (Waters Corp.) were used. A mobile phase of 8/4/88 
acetonitrile/tetra-hydrofurari/0.9 mmol sodium heptanedfonic acid in 50 mmol potassium 
dihydrogen phosphate (pH 4) was used during the first 7 min of the run. From 7.1-25 

10 min, the eluent consisted of 30/70 acetonitrile/5 mmol sodium heptanesulfonic acid in 50 
mmol potassium dihydrogen phosphate (pH 4). The flow rate was 0.9 ml/min. Retention 
times for SN-38G, SN-38 and camptothecin were 13.3, 18.4 and 19.3 min, respectively. 
SN-38 glucuronidation rates were reported as the ratios between SN-38 glucuronide (SN- 
38G) and internal standard (IS) peak heights. The intra-assay variability was determined 

15 by performing 10 incubations on the same day using a pool of human liver microsomes. 
The inter-assay variability was evaluated by incubating a pool of human liver 
microsomes in triplicate on three different days. The inter- and intra- assay variabilities 
were within 7%. 

20 Statistical analysis 

The significance of linkage disequilibrium between pairs of polymorphic sites 
was assessed using genotypic data and a likelihood ratio test provided in ARLEQUIN, 
version 2 (Schneider et al, 2000). ARLEQUIN was also used to run a modified Markov- 
chain random walk algorithm to test for Hardy- Weinberg equilibrium. Next, multisite 

25 haplotypes were estimated using the program PHASE (Stephens et aL, 2001). Because 
this program does not accept both bi- allelic and multi-allelic polymorphic sites, 
haplotypes were estimated only for individuals with either the (TA) 6 or (TA) 7 alleles. 

Thirteen individuals were heterozygous for the (TA) 5 or (TA) 8 repeat, three of 
which were heterozygous only at the TA repeat and therefore unambiguous at the other 

30 sites. For the remaining 10 individuals, haplotypes were determined manually by 
assuming that the chromosome with the (TA) 6 or (TA>7 allele contained a haplotype 
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previously identified by the PHASE analysis. In one case, this method would have 
resulted in a new (TA)g haplotype. However, it is more likely that this individual would 
instead have a novel (TA)6 haplotype (V), which is consistent with the observation that 
the (TA)6 allele is found on multiple haplotypes, including other rare ones. An incorrect 
5 assignment would have little or no affect on the subsequent analyses because the novel 
haplotype only occurs once out of 103 individuals and not in a sample used in studies of 
correlation with phenotype. 

The effective number of haplotypes was calculated as the reciprocal of the sum of 
the frequency squared. Diversity in (TA) 6 haplotypes in Caucasians and African- 

10 Americans, based on the numbers and frequencies of haplotypes and adjusted by sample 
size, was estimated by DnaSP version 3.53 (Rozas et al) as well as their SDs. Statistical 
significance was assessed using a t-test as previously described (Nei, 1987). The chi- 
square test was used to analyze the differences in genotype/haplotype frequencies 
between Caucasians and African- Americans. 

15 UGT1A1 activity was phenotyped by measuring SN-38 glucuronidation rates of 

each liver as the mean ± SD of a single experiment performed in triplicate. The statistical 
analysis of the relationship between the (TA)n polymorphism and phenotype was planned 
to assess first the genotype effect on phenotype in the population sample (n = 83) using 
the analysis of variance (ANOVA). If the genotype effect was statistically significant 

20 then, within each ethnic group, a test of trend across the genotypes was performed using 
the exact Jonkheerer-Terpstra (JT) test (Gibbons et aL, 1992). Pairwise comparisons 
between two genotypes were performed using an exact one-sided Wilcoxon test. 
Moreover, trend analysis and pairwise comparisons were performed in genotypes 
expressed as the sum of TA repeats in both chromosomes (i.e. in samples with <12 (5/6, 

25 6/6, 5/7), 13 (6/7) and <14 (7/7, 6/8, 7/8) TA repeat genotypes). Concerning the 
haplotype-phenotype relationship, two-sided exact Wilcoxon tests were used to compare 
the SN-38 glucuronidation rates between two haplotypes. SAS system (SAS Institute, 
Inc., Cary, North Carolina) and StatXact-5 (CYTEL Software Corporation, Cambridge, 
Massachusetts, USA) were used for statistical analysis. GraphPad software version 3.02 

30 (GraphPad Software Inc., San Diego, California, USA) was used for graphical analysis. 
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EXAMPLE 2 
GENOTYPING OF (TA)„ POLYMORPHISM 

The (TA)6 allele was the most common allele with a frequency of 0.58 while the 
(TA)7 allele had a frequency of 0.36 (Table 1). (TA) 5 and (TA)g alleles were also found, 
5 although at lower frequencies (0.02 and 0.05, respectively). In the population sample (n 
= 107), the most common genotype was 6/7 (0.41), followed by the 6/6 genotype (0.34). 
Rare genotypes (<0.02) included 5/6, 5/7 and 5/8 genotypes. The (TA) 6 and (TA) 7 allele 
frequencies were not significantly different between Caucasians and African- Americans 
(chi-square test, P= 0.7). Similarly, 6/6, 6/7, and 7/7 genotype frequencies were not 
10 different between the two ethnic groups (chi-square test, P = 0.8). One Asian individual 
had a 6/6 genotype, while two individuals with other ethnicities had 6/7 and 7/7 
genotypes. 



Table 1: (TAX polymorphism: genotype frequencies 



(TA)„ 


5/6 


5/7 • 


5/8 


6/6 


6/7 


6/8 


7/7 


7/8 


Population 

Sample 

(n=107) 


0.01 


0.02 


0.01 


0.34 


0.41 


0.06 


0.13 


.03 


Caucasians 
(n=56) 


0.02 


0 


0 


0.38 


0.46 


0 


0.13 


0.02 


African- 
Americans 
(n=39 


0 


0.05 


0.03 


0.26 


0.33 


0.15 


0.13 


0.05 



15 

EXAMPLE 3 
SEQUENCING OF PBREM 

20 In 103 samples, six polymorphisms were found, and two of them (-3279G>T and 

-3156G>A) are common, with frequencies of 0.39 and 0.30, respectively (FIG. 1, Table 
2). All six polymorphisms are in Hardy- Weinberg equilibrium (P > 0.5). Based upon 
comparisons to the baboon sequence (accession number AC091778, which is 
incorporated herein by reference), it is likely that -3279G and -31566 are the ancestral 

25 states. The most common -3279G>T polymorphism is located in the spacer sequence of 
the NR3 domain of PBREM (FIG. 1). No variants were found in the gtNRl domain, the 
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binding site for constitutive active receptor (CAR). -3279G was significantly more 
common among African- Americans compared to Caucasians (chi squared = 13.82, P = 
0.001) while the frequency of -3 156 A did not significantly differ between the two ethnic 
groups (chi-square test, P = 0.9). 

5 

Table 2 Sequencing of PBREM: genotype frequencies 



Position 


-3440 


-3401 


-3279 


-3177 


-3175 


-3156 


Genotype 


CC 


CA 


TT 


TC 


GG 


GT 


TT 


CC 


CG 


AA 


AG 


GG 


GA 


AA 


Population 

Sample 

(n=103) 


0.96 


0.04 


0.99 


0.01 


0.38 


0.46 


0.16 


0.99 


0.01 


0.99 


0.01 


0.49 


0.43 


0.08 


Caucasian 
(n=55) 


0.96 


0.04 


1 


0 


0.18 


0.58 


0.24 


1 


0 


1 


0 


0.47 


0.44 


0.09 


African- 
Americans 
(n=37) 


0.95 


0.05 


0.97 


0.03 


0.73 


0.24 


0.03 


0.97 


0.03 


0.97 


0.03 


0.51 


0.41 


0.06 



10 EXAMPLE 4 

LINKAGE DISEQUILIBRIUM AND HAPLOTYPE STRUCTURE OF THE 

UGT1A1 PROMOTER 

A likelihood ratio test detected significant pairwise linkage disequilibrium 
between sites -3279, -3156 and the (TA)n polymorphism in our population sample (n = 

15 103, P < 0.0001). When only the common (TA) 6 and (TA) 7 alleles were used for the 
linkage disequilibrium analysis, the same results were obtained (P < 0.0001). When 
pairwise linkage disequilibrium was separately assessed in Caucasians and African- 
Americans, highly significant linkage disequilibrium was similarly detected in 
Caucasians (P < 0.0001). In African-Americans, pairwise linkage disequilibrium was 

20 also detected between all sites, however, the level of significance varied greatly between 
the pairwise comparisons. Only linkage disequilibrium between (TA)n and -3156 had 
significance levels similar to those seen for Caucasians (P < 0.0005) while linkage 
disequilibrium had only low levels of significance between (TA)n and -3279 (P = 0.02) 
and between -3279 and -3156 (P = 0.04). 

25 Multisite haplotype inference resulted in 10 haplotypes spanning the PBREM 

variants and the (TA)n polymorphism (Table 3). Haplotypes I-V include the (TA) 6 allele, 
and haplotype I differs from haplotype II at position -3279 in the NR3 domain of 
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PBREM. Haplotypes VI, VII and VIH include the (TA) 7 repeat and haplotypes VI and 
VII differ from each other at position -3156. There is a suggestion that the haplotype 
structure of the (TA)6 allele is different in the African- American subsample. Compared 
to Caucasians, haplotype I is less common in African-Americans (chi squared = 27.06, P 
5 < 0.0001), while haplotype II is more common (chi squared = 14.84, P = 0.0001). 
Differences in haplotype VI and VTI frequencies were not statistically significant between 
the two groups (chi- square test, P = 0.44 and 0.48, respectively). 

Among the samples examined, 21 different combinations of these haplotypes 
were found. In Caucasians, the most frequent haplotype pairs are I/VI (0.35), I/I (0.24) 

10 and I/II (0.11), while in African-Americans, they are I/H (0.11), n/VI (0.11), ILVIH 
(0.08), I/VI (0.08), IWn (0.08) and VI/VI (0.08). The effective numbers of haplotypes, 
which reflect how many relatively high frequency haplotypes are observed, were 5.2 and 
2.6 in African-Americans and Caucasians, respectively (Table 3). Finally, diversity (± 
SD) of (TA) 6 haplotypes was 0.555 ± 0.070 and 0.262 ± 0.065 in African-Americans and 

1 5 Caucasians, respectively (P < 0.05). 
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EXAMPLE 5 

UGT1 Al PHENOTYPING AND ASSOCIATION WITH (TA) n POLYMORPHISM 

AND HAPLOTYPES 

UGT1A1 activity was measured as SN-38 glucuronidation rates in 83 human liver 
5 microsomes. A 46% coefficient of variation (1.90 f 0.87 SN-38G/IS, mean f SD) and a 
10-fold range in SN-38 glucuronidation were observed. 

Because of the small number of subjects in the 5/7, 5/6, 6/8 and 7/8 genotypes, 
only 6/6, 6/7 and 7/7 were used in the ANOVA analysis. The phenotype was 
significantly different across these three genotypes (P =0.008) (FIG. 2a). The degree of 

10 variation of the SN-38 glucuronidation rate across the genotypes was similar in different 
ethnic groups (P > 0.1). A significantly decreasing trend was shown across the 6/6, 6/7 
and 7/7 genotypes in Caucasians (P < 0.001, JT test, FIG. 2b) and across the 6/6, 6/7, 6/8 
and 7/7 genotypes in African-Americans (P = 0.033, JT test) (FIG. 2c). When samples 
with Asian (n = 1), other (n = 2) and unknown (n = 10) ethnic background were pooled 

15 together, no significant trend could be found across (TA)n genotypes (P > 0.1, JT test) 
(FIG. 2d). In the Caucasian sample, pairwise comparisons of the phenotype between two 
genotype groups showed significant differences between the 6/7 and 7/7 (P= 0.007, one- 
sided exact Wilcoxon test) and 6/6 and 7/7 groups (P = 0.0002). No pair-wise 
comparison was significant within African-Americans, probably due to small number of 

20 samples of each genotype. 

When (TA) n genotypes were regarded as the sum of TA repeat number in both 
chromosomes (i.e. <12 (5/6, 6/6, 5/7), 13 (6/7) and >14 (7/7, 6/8, 7/8) genotypes), a 
significant trend of reduced UGT1A1 activity (PO.01) was measured across the three 
groups (the lowest being the > 14 genotype group) in the whole sample population, in 

25 Caucasians, in African- Americans but not in samples with Asian/other/unknown ethnicity 
(P = 0.66). Pairwise comparisons (one-sided exact Wilcoxon test) showed significantly 
reduced UGT1A1 activity (PO.01) in >14 compared to 13 and <12 genotypes, and in 13 
compared to < 12 genotypes in the whole sample population and in Caucasians. In 
African- Americans <12 genotypes had significantly higher UGT1A1 activity compared to 

30 either 13 or >14 genotypes (P = 0.028 and 0.016, respectively), but UGT1A1 activity was 
not significantly different between 1 3 and >14 genotypes (P = 0. 1 1 ). 
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In samples of Caucasian and African origin, SN-38 glucuronidation rate varies 
significantly across the haplotypes with a decreasing trend (P < 0.0001, JT test) (FIG. 3). 
However, this apparent haplotype-phenotype correlation is likely to be due to the effect 
of the (TA) n polymorphism that is in linkage disequilibrium with the PBREM variants. 
5 Hence, the possible functional effects of the common -3279G>T and -3156G>A variants 
were investigated by comparing the SN-38 glucuronidation rates across genotypes that 
differed only by the variant being examined. Concerning the -3279G>T variant, SN-38 
glucuronidation was reduced in VU pairs compared to I/I pairs among Caucasians, 
although without reaching statistical significance (2.06 ± 0.74 versus 2.53 ± 0.82 SN- 
10 38G/IS, respectively) (Wilcoxon rank sum test, P = 0.18). Concerning the -3156G>A 
variant, although SN-38 glucuronidation is slightly reduced in WU compared to I/VI 
pairs, the difference is not statistically significant (Wilcoxon rank sum test, P = 0.64). 

EXAMPLE 6: 
MATERIALS AND METHODS FOR EXAMPLE 7 
15 Patient Selection 

Patients with histologically confirmed solid tumors or lymphoma known to 
respond to irinotecan or for which no therapy of proven benefit exists were eligible to 
participate in this study. Other eligibility criteria included measurable disease by 
radiologic imaging or physical examination; age of at least 18 years; Karnofsky 

20 performance status of at least 70% (ambulatory and capable of self-care); and adequate 
organ function defined as absolute neutrophil count (ANC) >1500 \xl'\ platelet count > 
100,000 serum creatinine level <1.5 mg/dl or creatinine clearance >60 ml/min, AST 
and ALT levels < 5 times the upper limit of normal, and conjugated bilirubin within 
normal limits. Patients must have been off previous anticancer therapy, including 

25 radiation therapy, for at least 4 weeks (6 weeks if the previous treatment included a 
nitrosourea or mitomycin C) and off colony stimulating factor for at least 2 weeks. 
Patients with a history of inflammatory bowel disease requiring therapy, chronic diarrheal 
syndrome, paralytic ileus, or organ or stem cell transplant were excluded from the study. 
Concurrent use of medications that may be substrates of the UGT1A1 enzyme or that 

30 may be inducers or inhibitors of UGT1A1 activity was not permitted. Pregnant and 
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lactating women were also excluded from participation, and those with reproductive 
potential were required to use an effective contraceptive method if sexually active. 

Treatment Protocol 

Irinotecan was supplied by the National Cancer Institute (NCI) as an intravenous 
5 solution with concentration 20 mg/ml in either 2 ml or 5 ml vials. The amount of 
irinotecan to be administered was removed aseptically from the vial and added to 500 ml 
of 0.9% saline or 5% dextrose injection, USP. Thirty minutes after pretreatment with 20 
mg intravenous ondansetron, irinotecan 350 mg/m 2 was administered as a 90 minute 
intravenous infusion once every 3 weeks — a standard dose and schedule. History, 

10 physical examination, complete blood count (CBC) with differential, serum chemistry 
profile (electrolytes, blood urea nitrogen, creatinine, glucose, albumin, alkaline 
phosphatase, GGTP, AST, ALT, total and conjugated bilirubin, uric acid, and lactate 
dehydrogenase), and coagulation profile (prothrombin time and partial thromboplastin 
time) were conducted prior to first treatment. Thereafter, history, physical examination, 

15 and toxicity assessment were conducted on day 1 of each cycle unless treatment-related 
toxicity required more frequent follow up. CBC and serum chemistry profile were 
obtained weekly throughout treatment, though CBCs were obtained 3 times per week 
with the appearance of grade 3 or 4 neutropenia or thrombocytopenia. Toxicity 
assessment was done according to the NCI common toxicity criteria, version 2.0 

20 (website: ctep.cancer.gov). Objective tumor assessment by appropriate radiographic 
imaging was performed prior to starting therapy and after every 2 cycles. 

Toxicity Management and Dose Modification 

For patients who experienced diarrhea, abdominal pain, or diaphoresis within 24 
hours of irinotecan administration, 0.25 mg to 1 mg of intravenous atropine was 

25 considered. Delayed diarrhea, defined as diarrhea occurring more than 24 hours after 
irinotecan administration, was treated promptly with loperamide 4 mg at the onset and 
then with 2 mg every 2 hours until the patient was diarrhea-free for at least 12 hours. For 
patients who failed loperamide therapy, diphenoxylate, octreotide, and tincture of opium 
were sequentially added as needed. Patients were instructed to aggressively hydrate 

30 orally and were admitted to the hospital for intravenous electrolyte and fluid replacement 
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when necessary. A new course of therapy was not started until the ANC recovered to at 
least 1500 jil" 1 , the platelet count recovered to at least 100,000 and treatment-related 
diarrhea fully resolved. Patients with grade 3 or 4 toxicities of any kind were dose- 
reduced by 50 mg/m 2 for subsequent cycles. 

5 Sample Collection 

Prior to the first irinotecan infusion, venous blood (4.5 ml) for genotyping was 
collected in purple top Vacutainei® tubes containing EDTA (Becton, Dickinson, and 
Company, Franklin Lakes, NJ) and stored at -80°C for no more than 5 days prior to 
analysis. Venous blood for pharmacokinetic analysis was collected on day 1 of cycle 1 

10 for pharmacokinetic analysis. Samples of 7 ml were collected into green top sodium 
heparinized Vacutainer® tubes prior to the infusion; 30, 60, and 90 minutes during the 
infusion; and 10, 20, 30, 45, and 60 minutes and 1.5, 2, 4, 6, 12, and 24 hours after the 
infusion. Samples were centrifuged (2500 rpm, 20 min, 4°C) and the plasma was 
immediately separated, transferred as two aliquots into storage tubes, frozen at -80°C 

15 until analysis. 

UGT1 Al Genotyping Assays 

The variants typed in this study are listed in Table 4. The UGT1A1 (TA)nTAA 
polymorphism was genotyped by PCR and product sizing as previously described (Te et 
al, 2000). Alleles with 6 TA repeats resulted in a 98 bp fragment while alleles with 7 
20 TA repeats resulted in a 100 bp fragment. Alleles with 5 TA and 8 TA repeats resulted in 
96 bp and 102 bp fragments respectively. Alleles with 5, 6, 7, and 8 TA repeats are 
reported as (TA)n and genotypes are assigned based upon the number of TA repeats in 
each allele, i.e., 6/6, 6/7, 7/7, 6/8, et cetera. 

The variants in the 5 'upstream region (-3279G>T and -3156G>A) and in exon 1 
25 [21 1G>A (G71R) and 686C>A (P229Q)] were genotyped by single base extension (SBE) 
and separated on a denaturing high performance liquid chromatography (DHPLC) system 
(Devaney et al, 2001). Genotyping of the -3279G>T and -3156G>A variants was 
performed by PCR amplification of a 333 bp fragment in the UGT1A1 5' upstream 
region that contains both variants. The PCR primers used were: 5'-ACC TCT AGT TAC 
30 ATA ACC TGA A-3' (forward primer; SEQ ID NO:6) and 5'-AAT AAA CCC GAC 
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CTC ACC AC-3' (reverse primer; SEQ ID N0:7). PCRs were performed in a 15 nl 
volume containing 125 nM each primer, 2.5 mM MgCl 2 , 50 |iM each dNTP and 0.375 U 
of AmpliTaq Gold polymerase (Applied Biosystems) in the buffer provided by the 
manufacturer. PCR cycling conditions were for 40 cycles at 95°C for 15 s, 58°C for 15 s 
5 and 72°C for 30 s in a 9600 thermal cycler (Applied Biosystems). PCR amplified 
products were purified using shrimp alkaline phosphatase and exonuclease I by 
incubating at 37°C for 45 min prior to the SBE reaction. SBE reactions were performed 
in duplex for genotyping of both variants in 10 jil volumes containing 1 jaM of extension 
primer (5'-GCC AAG GGT AGA GTT CAG T-3' (SEQ ID NO:8) for -3279G>T and 5'- 

10 GAC CCC AGC CCA CCT GTC-3' (SEQ ID NO:9) for -3156G>A), 250 |oM each 
ddNTP and 1.25 U thermosequenase (Amersham Pharmacia Biotech). Reactions were 
cycled at 96°C for 30 s, 55°C for 30 s and 60°C for 30 s for 60 cycles. Separation of the 
SBE products was performed on a WAVE 3500HT DHPLC system (Transgenomic Inc) 
at 70°C after denaturation of the samples. The flow rate used was 1 .5 ml/min and the run 

15 time for each sample was 2.5 min. The gradient used for elution of the SBE products was 
created by the software based on the length of the extended product and was adjusted 
from 24% to 34% buffer B over 2 min (buffer B contains 25% acetonitrile). Extended 
products were eluted in the order of C<G<T<A which is dependent on the hydrophobicity 
differences of the four bases. 

20 Genotyping of the 21 1G>A and 686C>A exon 1 variants was performed by PCR 

amplification of a 774 bp fragment that encompasses both variants. The PCR primers 
used were: 5'-ATG CTG GGA AGA TAC TGT TG-3' (forward primer; SEQ ID NO:10) 
and 5'-TTT GGT GAA GGC AGT TGA TT-3' (reverse primer; SEQ ID NO:ll). PCRs 
were performed in a 15 volume containing 125 nM each primer, 2.5 mM MgCh, 100 

25 |iM each dNTP and 0.375 U of AmpliTaq Gold polymerase (Applied Biosystems) in the 
buffer provided by the manufacturer. PCR cycling conditions were for 40 cycles at 95°C 
for 15 s, 55°C for 15 s and 72°C for 45 s in a 9600 thermal cycler (Applied Biosystems). 
PCR purification was performed as described above and the SBE reactions were 
performed in lOul volumes containing 1 jiM of each extension primer (5'-GTC TTC 

30 AAG GTG TAA AAT GCT C-3' (SEQ ID NO:12) for 211G>A or the 5'-GTG CGA 
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CGT GGT TTA TTC CC-3' (SEQ ID N0:13) for 686OA) using the conditions 
described above. For separation on the DHPLC system, a flow rate of 1.5 ml/min and a 
run time of 3 min was used for each sample. The gradient used for elution of the SBE 
products was created by the WAVE software based on the length of the extended product 
and was adjusted from 25.6% to 38.1% buffer B over 2.5 min. 

Pharmacokinetic Analysis 

Plasma concentrations of irinotecan and its metabolites were determined as 
previously published (Iyer et al, 2001). Pharmacokinetic parameters for irinotecan, SN- 
38, and SN-38G were calculated using standard non-compartmental methods with 
WinNonlin 2.0 (Pharsight Corporation, Mountain View, CA). The area under the plasma 
concentration-time curve (AUC) from time zero to the last measured concentration of 
irinotecan and metabolites was determined by the linear trapezoidal method. The 
glucuronidation ratio was expressed as the ratio of the SN-38G AUC over SN-38 AUC. 

Statistical Analysis 

The study was originally designed to prospectively investigate the relationship 
between genetic variation in the UGT1 Al promoter and grade 3-4 diarrhea. Results from 
clinical trials using the 350 mg/m2 every 3 weeks schedule suggested a 20 to 35% 
frequency of diarrhea (ref). Based on previously published data, a single-gene Mendelian 
model implied that 16% of patients would have the 7/7 genotype, 48% would have the 
6/7 genotype, and 36% would have the 6/6 genotype. A sample size of 60 would have 
had power of 0.8 at a=0.05 to detect a linear trend in the proportion of patients within 
each genotype experiencing grade 3-4 diarrhea defined by 60% of 7/7 patients, 30% of 
6/7 patients, and 10% of 6/6 patients. 

However, due to lower than expected frequency of grade 3-4 diarrhea (see below), 
the analyses were instead focused on the frequency of grade 4 neutropenia (ANC < 500 
^il" 1 ). Nonparametric trend tests were used to investigate how the genotype is related to 
pharmacokinetic parameters, pretreatment bilirubin levels and ANC nadir. The 
relationship between genotype and grade 4 neutropenia was assessed by the use of 
Fisher's exact test and calculation of the relative risks. Univariate regression analyses 
were performed to identify the potential predictors of ANC nadir. They were performed 
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on the log scale for ANC to reduce skewness in the residuals. The pretreatment variables 
were also considered jointly via analysis of covariance (ANCOVA) models in order to 
identify the pretreatment measurements that can predict ln(ANC nadir). A different 
ANCOVA model simultaneously considering the pre- and post- treatment variables was 
5 used to explore the mechanism through which variability in UGT1 Al status might affect 
the ANC nadir. 

EXAMPLE 7: 
Role of-3516G>A of UGT1A1 in Irinotecan Toxicity 
Patient Characteristics 

10 Sixty-six patients were enrolled in the study (Table 5). Blood was mistakenly not 

drawn for DNA extraction in one patient and genotype information is available in 65 
patients. Sixty-three patients were assessable for toxicity as 3 patients (one 6/6, one 6/7, 
one 7/8) missed scheduled blood tests and/or physician appointments. Sixty patients are 
assessable for tumor response, as 6 of them were removed from the study before 

15 radiological assessment of tumor response. All the patients received prior chemotherapy 
regimens. Thirty-five of them received additional prior radiotherapy. 

Allele and Genotype Frequencies 

The TA indel allele frequencies were: TA^O.68, TA 7 =0.29, TA 8 =0.02, 
20 TA 5 =0.01 . The TA 5 and TA 8 alleles occurred exclusively in Black patients (one with 5/6, 
two patients with 6/8, and one patient with 7/8 genotype). -3279T and -3156A alleles 
had a frequency of 0.55 and 0.26, respectively. 

Table 6 shows the frequencies of promoter haplotypes comprising -3279, -3156, 
and the TA indel, based upon our previous publication on their linkage disequilibrium 
25 (Innocenti et al., 2002). The frequency of the haplotype pairs is shown in Table 7. No 
exon 1 variants (21 1G>A and 686OA) were detected in this patient population. 

Toxicity Prevalence, Relative Risk, Genetic Test. 

Toxicity of diarrhea and neutropenia refer to events observed during cycle 1 of 
30 treatment. The frequency of grade 4 neutropenia was 9.5%. Grade 4 neutropenia was 
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much more common in patients with genotype 7/7 (3/6, 50%) compared to patients with 
6/7 genotype (3/24, 12.5%) and 6/6 genotype (0/30, 0%) (p=0.001, Fisher's exact test). 
Nonparametric trend analysis revealed that the TA indel polymorphism is significantly 
correlated to ln(ANC nadir) (7/7<6/7<6/6, z=-2.35, p = 0.02) (FIG. 4). 

Because the -3156G>A variant distinguishes between two different haplotypes in 
the TA 7 individuals, the relative risk of grade 4 neutropenia was analyzed for the -3156 
AA genotype (versus AG and GG combined) and 7/7 genotype (versus 5/6, 6/6, 6/7 and 
6/8 combined). A higher relative risk was found in patients with -3156 AA genotype 
(14.0, 95% CI 2.1-36.7) compared to patients with 7/7 genotype (9.3, 95% CI 1.7-40.7, 
n=63). Moreover, the predictive power of a genetic test in patients receiving irinotecan 
was evaluated for both the TA indel and the -3156 variant (Table 8). The predictive 
power of either 7/7 or -3156 AA genotypes for grade 4 neutropenia was evaluated. In 
addition, the predictive power of either 6/6 or -3156 GG genotypes was evaluated in 
relation to the absence of grade 4 neutropenia (i.e., grade 0-3). In this comparison, the 
two 6/8 patients were regarded as either 6/6 or 6/7 genotypes in order to assess whether 
patients with the TAg allele might be a confounding factor for the results of the genetic 
test. 

While this study was originally conceived to examine the relationship between 
UGT1A1 genotype and severity of diarrhea, the frequency of grade 3 diarrhea in our 
patients was only 5% (n=3), with no instances of grade 4 diarrhea. None of the three 
patients with grade 3 diarrhea were 6/6 (2 6/7 and one 7/7 genotypes). Concerning the 
diarrhea events in patients with the TAg allele (two 6/8 and one 7/8), only a grade 1 event 
was reported in one 6/8 patient. The low frequency of severe diarrhea did not allow any 
formal statistical analysis. 

Total Bilirubin: Correlation with TA Indel Genotype and Toxicity 

Pretreatment total bilirubin levels were obtained in all patients (0.510.22 mg/dl, 
mean±SD, n=66). As is shown in FIG. 5, total bilirubin levels were significantly 
correlated with the TA indel polymorphism (nonparametric trend analysis, 7/7>6/7>6/6, z 
= 2.88, p < 0.01). Total bilirubin levels were significantly higher in 7/7 patients 
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compared to 6/6 and 6/7 patients combined (0.80±0.29 and 0.48±0.19 mg/dl, 
respectively, p = 0.0003). Concerning the distribution of the -3156 genotypes withing 
each TA indel genotype group, in the 6/7 genotype group, the three patients with GG 
genotype had low bilirubin levels of 0.3-0.4 mg/dl. Similarly, the two patients with 6/8 
5 and GG genotypes had low levels of bilirubin of 0.2-0.3 mg/dl. The one patient with GA 
genotype in the 7/7 group has a bilirubin level of 0.6 mg/dl, which is in the low range for 
this genotype group. The 7/8 patient did not have markedly elevated levels of total 
bilirubin as would be expected if the TA 8 allele resulted in decreased glucuronidation. 

In addition, the -3156 and the TA indel variants were correlated with total 
10 bilirubin by multiple regression analysis. The AA genotype showed a slightly better 
correlation (r^O.28, pO.OOOl) compared to 7/7 genotype, either when the TA 8 alleles 
were regarded as TA* (r^O.23, p=0.002) or TA 7 (^=0.20, p=0.0009). The other common 
variant -3279G>T had no significant association with total bilirubin. 

Whether pre-treatment bilirubin would correlate with neutropenia was also 
15 analyzed. Significantly higher bilirubin levels were observed in patients with grade 4 
neutropenia (0.83±0.2 1 mg/dl) compared to those without grade 4 neutropenia (0.4710.20 
mg/dl) (p=0.0001) (FIG. 6). No cases of grade 4 neutropenia were reported in patients 
with bilirubin levels less than 0.6 mg/dl. Out of the 7 patients with total bilirubin higher 
then 0.7 mg/dl, 4 of them had grade 4 neutropenia. 

20 Correlation Between TA Indel Genotype and PK Parameters 

Table 9 describes the pharmacokinetic parameters of irinotecan and its 
metabolites stratified by 6/6, 6/7, and 7/7 genotypes. SN-38 AUC increases while 
increasing the number of TA 7 alleles (nonparametric trend analysis, 7/7>6/7>6/6, z = 
2.13, p = 0.03). Conversely, glucuronidation ratios (SN-38G/SN-38 AUC ratios) were 
25 reduced while increasing the number of TA 7 alleles, (nonparametric trend analysis, 
6/6>6/7>7/7, z—2.16, p = 0.03). No significant trend was found for irinotecan and SN- 
38GAUCs(p>0.05). 
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Regression Analysis 

The impact of both pharmacokinetic variability and pre-treatment (including 
genotype) variables on variability in neutropenia was also examined. Instead of the TA 
indel genotype, the -3156 variant was used because 1) the -3156 genotype was better 
5 correlated with the risk of grade 4 neutropenia and 2) -3156 better reflected the UGT1A1 
status of patients, based upon the data on the correlation with total bilirubin. Univariate 
regression analyses of ANC nadir selected SN-38 AUC, total bilirubin and -3156 
genotype as the three best independent variables (Table 10). Gender showed a non- 
significant correlation with ANC nadir but it was included in further modeling because of 
10 possible gender differences in glucuronidation. Other variables did not show any 
correlation. 

Multivariate Analyses 

Several multivariate predictive ANCOVA models were considered to identify the 
15 pretreatment measurements that predict ln(ANC nadir). The final model (r^O.41) was 
selected by backward elimination from Table 10 and is presented Table 1 1 . Pretreatment 
bilirubin level is found to be very significant and negatively related to ln(ANC nadir). 
Gender and -3156 genotype are found to be marginally significant after adjusting for the 
total bilirubin level. Ln(ANC nadir) is found to have a lower value in women, and it 
20 decreases with increasing number of (TA) 7 alleles (6/6>6/7>7/7). Other factors, such as 
ethnicity, number of prior regimens, performance status, and ln(pretreatment ANC) are 
not found to be significant predictors of ln(ANC nadir) after adjusting for -3156 
genotype, gender and total bilirubin. 

After determining the predictive model using pre-treatment variables, the post- 
25 treatment measurements of irinotecan AUC, SN-38 AUC, SN-38G AUC, and 
glucuronidation ratio were added to the model as independent variables with the intention 
of determining the possible mechanism of how the variability in UGT1A1 status affects 
ln(ANC nadir). The final model selected through backward elimination (^=0.5141) 
which best predicts ln(ANC nadir) includes genotype and SN-38 AUC (pO.OOl) (Table 
30 12). 
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Toxic Death and Response 

One toxic death was reported, as the patient died of neutropenia-related sepsis. 
He was admitted to the hospital on day 7 of cycle 1 with fever and an no neutrophils 
detected (white blood cell count of 100 jil* 1 ). He was empirically treated with 
ceftazadime, tobramycin, and fluconazole, though no infectious source was ever 
identified. Despite support with granulocyte colony stimulating factor, the patient 
remained neutropenic, became septic, and died on day 11. He had 7/7 genotype and the 
highest level of pretreatment total bilirubin observed in these patients (1 .2 mg/dl). 

Concerning the response rates in this trial, three objective responses were 
observed. Two patients achieved a partial response (one with colorectal and the other 
with head and neck cancer) and had a 6/7 genotype. One colorectal cancer patient 
achieved a complete response and had a 6/6 genotype. 



Table 4 

UGT1A1 variants typed in this study. Positions indicated are from the first base of the 
UGT1A1 start site in the UGT1A cluster reference sequence (AF297093). 



Nucleotide 


Amino acid 


Exon 


change 


change 




-3156G>A 




Promoter 


-3279G>T 




Promoter 


TA indel 




Promoter 


211G>A 


G71R 


1 


686OA 


P229Q 


1 



Table 5 
Patient characteristics 



No. of patients 



Patients 

Entered 

Assessable for toxicity 

Assessable for response 

Sex 

Male 

Female 

Age, median (range) 
Ethnicity 
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66 
63 
60 

39 
27 

60 (34-85) 
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White 50 

Black 10 

Hispanic 4 

Pacific Islander 1 

Asian 1 
Performance Status 

100% 18 

90% 31 

80% 10 

70% 17 
Tumor type 

Colorectal 10 

Gastroesophageal 14 

Head and Neck 5 

Liver 2 

Lung 19 

Pancreas 3 

Unknown Primary 4 

Others 9 

Prior Radiotherapy 35 
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Table 6 



Frequency of UGT1A1 promoter haplotypes. 



-3279G>T 


-3156G>A 


TA indel 


Frequency 


T 


G 


6 


0.55 


G 


G 


6 


0.13 


G 


A 


7 


0.25 


G 


G 


7 


0.03 


G 


G 


8 


0.02 


G 


G 


5 


0.01 



Table 7 
Frequency of haplotype pairs 



Haplotype pairs 


Frequency 


TG6/TG6 


0.28 


TG6/GA7 


0.28 


TG6/GG6 


0.18 


GA7/GA7 


0.08 


GG6/TA7 


0.06 


TG6/GG7 


0.05 


TG6/GG8 


0.02 


GG6/GG8 


0.02 | 


GG5/TG6 


0.02 


GA7/GG8 


0.02 



The haplotypes reflect the change of -3279, -3156, and the TA indel variants, such as the 
first base refers to -3279 variants, the second to -3156 variant and the number refers to 
the number of TA repeats. 

Table 8 



Genetic tests for the TA indel and -3156 genotypes 





Sensitivity 


Specificity 


PPV 


NPV 


7/7, grade 4 


0.50(0.19-0.81) 


0.95 (0.85-0.98) 


0.50(0.19-0.81) 


0.95 (0.85-0.98) 


-3156 AA, grade 4 


0.50(0.19-0.81) 


0.96 (0.88-0.99) 


0.60(0.23-0.92) 


0.95 (0.86-0.98) 


6/6, grade 0-3, 6/8=6/6 


0.57 (0.44-0.69) 


1.00(0.61-1.00) 


1.00(0.89-1.00) 


0.20(0.10-0.37) 


6/6, grade 0-3, 6/8=6/7 


0.54(0.41-0.66) 


1.00 (0.61-1.00) 


1.00 (0.89-1.00) 


0.19(0.09-0.35) 


-3156 GG, grade 0-3 


0.63 (0.49-0.74) 


1.00 (0.61-1.00) 


1.00(0.90-1.00) 


0.22(0.11-0.41) 



PPV, positive predictive value. 
NPV, negative predicted value. 



Data are shown with 95% CI in parenthesis. The patient with 5/6 genotype was regarded 
as having a 6/6 genotype. 
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Table 9 

Pharmacokinetic parameters and by 6/6, 6/7, and 7/7 TA indel genotypes 



Irinotecan SN-38" SN-38G Glucuronidation Ratio b 

TA indel No. of AUC AUC AUC (SN-38G AUC/SN-38 AUC) 
genotype patients (ng*h/ml) (ng*h/ml)(ng*h/ml) 



6/6 30 


24412.8 335.9 1954.2 


6.52 




(7691.6) (167.7) (1361.1) 


(3.98) 


6/7 25 


26085.5 458.4 1887.9 


5.55 




(10814.2) (379.8) (1682.5) 


(4.79) 


7/7 6 


25432.9 542.0 1819.1 


3.59 




(6694.9) (195.3) (1249.8) 


(2.81) 



a 6/6<6/7<7/7, z=2.13, p=0.03, non-parametric trend analysis. 
b 6/6>6/7>7/7, z=-2.16, p=0.03, non-parametric trend analysis. 



Data expressed as mean (standard deviation). 



Table 10 

Univariate analysis of In(ANC nadir) 



Independent Variable 



SN-38 AUC 03523 <0 0001 

Pre-treatment total bilirubin 0.2979 <0.0001 

-3156 genotype 0.2413 0.0003 

Irinotecan AUC 01273 0 0041 

Glucuronidation ratio 0. 1 1 7 1 0.0060 

Gender 0.0445 0.0971 

SN-38G AUC 00411 01109 

Age >70 0.0242 0.2231 

White ethnicity 0.0128 0.3764 

Ln(pre-treatment ANC) 0.0000 0.9749 

Performance status 0.0016 0.9923 
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Table 11 

ANCOVA for the final predictive model of ln(ANC nadir) using p re-treatment 

variables 

5 





Coefficient 


SE 


p-value 


Intercept 


8.1885 


0.2767 


O.001 


Genotype 








AA vs. GG+GA 


-0.9401 


0.3986 


0.022 


Gender 








Males vs. Females 


0.4323 


0.2001 


0.035 


Total Bilirubin 


-1.8452 


0.4816 


<0.001 



SE, standard error 

The overall model shows an r 2 value of 0.4048 (pO.OOOl). 



10 Table 12 

ANCOVA for the final predictive model of In(ANC nadir) using pre-treatment and 

post-treatment variables 

Coefficient SE p-value 

Intercept 8.3111 0.1517 <0.001 

Genotype 

AA vs. GG+GA -1.3798 0.3234 O.001 



15 



SN-38 AUC 



-0.0019 0.0003 O.001 



SE, standard error 

The overall model shows an lvalue of 0.5 128 (pO.OOOl). 



All of the compositions and/or methods disclosed and claimed herein can be made 
and executed without undue experimentation in light of the present disclosure. While the 

20 compositions and methods of this invention have been described in terms of preferred 
embodiments, it will be apparent to those of skill in the art that variations may be applied 
to the compositions and/or methods and in the steps or in the sequence of steps of the 
method described herein without departing from the concept, spirit and scope of the 
invention. More specifically, it will be apparent that certain agents that are both 

25 chemically and physiologically related may be substituted for the agents described herein 
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while the same or similar results would be achieved. All such similar substitutes and 
modifications apparent to those skilled in the art are deemed to be within the spirit, scope 
and concept of the invention as defined by the appended claims. 
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